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Method for the expression of unknown environmental DNA into adapted host cells 

Introduction and Background 

5 The present invention relates to methods and compositions for nucleic acid production, 
analysis and cloning. The present invention discloses tools and methods for the 
production and analysis of libraries of polynucleotides, particularly metagenomic 
libraries, which can be used to identify novel pathways, novel enzymes and novel 
metabolites of interest in various areas, including pharmaceutical, cosmetic, 
10 agrochemical and/or food industry. 

Drug discovery process is based on two main fields, namely combinatorial chemistry 
and natural products. Combinatorial chemistry has shown its ability to generate huge 
amounts of molecules, but with limited chemical diversity. At the opposite, natural 
15 products have been the most predominant source of structural and molecular diversity. 
However, the exploitation of this diversity is strongly hampered by their limited access, 
complex identification and purification processes, as well as by their production. 

Microorganisms are known to synthesize a large diversity of natural compounds which 
20 are already widely used in therapeutic, agriculture, food and industrial areas. However, 
this promising approach to the identification of new natural compounds has always been 
considerably limited by the principal technological bolts of isolating and in vitro 
propagating the huge diversity of bacteria. Most microorganisms living in a natural, 
complex environment (soil, digestive tract, sea, etc.. .) have not been cultivated because 
25 their optimal living conditions are either unknown or difficult to reproduce. Numbers of 
scientific publications relate this fact and it is now assumed that less than about 1% of 
the total bacterial diversity (when all environments are considered together) have been 
isolated and cultivated (Amann et al, 1995). 
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New approaches have been developed to try to overpass the critical step of isolation, and 
to access directly to the huge genetic potential established by the microbial adaptation 
processes ihrough their long evolution. These approaches are called "Metagenomic" 
5 because they address a plurality of genomes of a whole bacterial community, without 
any distinction (metagenome). 

Metagenomics involve direct extraction of DNAs from environmental samples and their 
propagation and expression into a cultivated host cell, typically a bacteria. Metagenomic 

10 has been firefly developed for the identification of new bacterial phylum (Pace et 
a/.1997). This use is based on the specific cloning of genes recognized for their interest 
as phylogenetic markers, such as 16S rDNA genes. Further developments of 
Metagenomics relate to the detection and cloning of genes coding for proteins with 
environmental or industrial interest These first two applications of metagenomic involve 

15 a first step of selection of genes (generally using PCR) before cloning. In the case of 
protein production, the cloning vector used are also expression vectors, i.e., they contain 
regulatory sequences upstream of the cloning site causing expression of the cloned gene 
in a given bacterial host strain. 

More recent developments of metagenomic consider the total metagenome cloned 
20 without any selection and/or identification, to establish random "Metagenomic DNA 
libraries". This provides an access to the whole genetic potential of bacterial diversity 
without any "a priori" selection. Metagenomic DNA libraries are composed of hundreds 
of thousands of clones which differ from each other by the environmental DNA 
fragments which have been cloned. In this respect, large DNA fragments have been 
25 cloned (up to 40 Kb), so as to (i) limit the number of clones which have to be analysed 
and (ii) to be able to recover whole biosynthetic pathways. This last point is of particular 

interest for bacterial metagenomic-libraries since, most of the time, the biosynthetic 

pathways have been found to be naturally organised in a same cluster of DNA and even 
in the same operon in bacteria. 
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Except for the identification and characterisation of bacterial community at the 
phylogenetic or diversity levels, metagenomic libraries produced in the prior art are gene 
expression libraries, i.e., me environmental DNA fragments are cloned downstream of a 
5 functional promoter, to allow their expression and analysis. In this regard, W099/45154 
and W096/34112 relate to combinatorial gene expression libraries which comprise a 
pool of expression constructs where each expression construct contains DNA whicb is 
operably associated with one or more regulatory regions that drive expression of genes 
in an appropriate organisms. Furthermore, the expression constructs used in these 
10 methods have a very limited and invariable host range. Similarly, WO 01/40497 relates 
to the construction and use of expression vectors which can be transferred in one chosen 
expression bacterial host of the Streptomyces genus. All these approaches are, however, 
very limited since they require the presence of expression signals and confer invariable 
or very limited host range capabilities. Furthermore, most (if not all) metagenomic DNA 
15 libraries are established in E. coli which is the most efficient cloning system. However, 
most environmental DNA are not expressed or frmctionally active in E. coli. In 
particular, functional analysis in E. coli of genes cloned from G+C rich organisms, such 
as Actinomyces, could be limited by the lack of adequate transcription and translation 
system. Also, posttranslational modification system in E. coli is not operative on 
20 heterologous proteins from Actmomicetes and some specific substrates for proteins 
activity are not present in E. coli. 

Thus, the exploitation of the huge potential of metagenomics for the discovery of new 
natural compounds, pathways or genes cannot be achieved with currently existing 
25 methods. Alternative technologies and processes must be developed, to allow production 
of efficient libraries and functional screening in a large variety of host cell species, 
including Bacillus or Streptomyces, to take full account of the huge diversity of the 
environmental DNAs. 
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Summary of the Invention 

The present invention discloses improved tools and methods for the production and 
analysis of libraries of polynucleotides, particularly metagenomic libraries, which can be 
5 used to identify novel pathways, novel enzymes and novel metabolites of interest. 

More particularly, the invention now proposes to keep the advantage of high efficient 
cloning in E. coli and to modify the properties of metagenomic libraries, to allow 
functional analysis of particular selected clones in any appropriate system, thereby 
10 making possible the analysis of the huge diversity of metagenomic libraries. According 
to the invention, polynucleotide libraries can be produced in any convenient cloning 
system, such as E. coli, and then modified, depending on the desired selection or 
screening system, to adapt host range and/or properties of the library (or a portion 
thereof). 

15 

A particular object of this invention resides more specifically in a method of analysing a 
library of polynucleotides, said polynucleotides being contained in cloning vectors 
having a particular host range, the method comprising (i) selecting cloning vectors in the 
library which contain a polynucleotide having a particular characteristic, (ii) modifying 
20 said selected cloning vectors to allow a transfer of said vectors and/or expression of the 
polynucleotide which they contain into a selected host cell, and (iii) analysing the 
polynucleotides contained in said modified vectors upon transfer of said modified 
vectors into said selected host cell. 

25 An other object of this invention is a library of polynucleotides, wherein said library 
comprises a plurality of environmental DNA fragments cloned into cloning vectors, 

wherein said environmental DNA fragments contain a common "mbleculaf characteristic 

and wherein said cloning vectors are E. coli cloning vectors comprising a target 
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polynucleotide construct allowing transfer and/or expression of the environmental DNA 
into a selected host cell distinct from E. colL 

A further object of this invention is a method of producing modified libraries of 
5 polynucleotides, the method comprising selecting a sub-population of clones in a first 
library, based on the presence or absence of a characteristic of interest, and modifying 
the properties of said selected clones to allow their functional analysis or expression. 

The modification in the library or cloning vector is typically obtained by targeted 
10 insertion of a polynucleotide construct, preferably using transposable elements, either in 
vitro or in vivo. 

The invention is particularly suited for producing and analysing genetic diversity 
(metagenomic libraries), to identify and isolate new metabolites, drugs, enzymes, 
15 antibiotics, etc. 

Legend to the Figures 

Figure 1 : Map of pPl vector carrying transposable construct Tn<Apra> fig. la. Excised 
20 transposable construct is 1 132 bps in size. It contains two mosaic ends (ME) and a gene 
conferring resistance to apramycine (Apra) fig. lb. 

Figure 2 : pPLl Vector (fig. 2a) that carries the conjugative transposable construct 
Tn<Apra-oril> (fig. 2b). The nucleotide sequence of the transposable construct contains 
25 an origin of transfer (oriT) and a gene conferring resistance to apramycine (Apra). 
Direction of DNA transfer at oriT is shown by an arrow. 

Figure 3 : Transposable construct Tn<Apra-oriT-att-int> (fig. 3a) on vector pPAOI6 (fig. 
3b). The transposable construct contains OC31 integrase gene and attachment DNA 
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sequence for site specific integration, origin of transfer and gene for selection. The 
orientation of genes and direction of DNA transfer are marked by arrows. 

Figure 4 : This figure shows any target DNA suitable for insertion of transposable 
5 construct (fig. 4a). Insertion of conjugative and site specific integrative transposable 
construct Tn<Apra-oriT-int> is shown on fig. 4 b and c. Insertion of transposable 
construct into selective gene marker carried on original vector is shown on fig. 4b. In 
this event, the cloned insert is intact and can be transferred to heterologous host. 
Insertion of transposable construct into cloned DNA insert results in gene inactivation 
10 (fig. 4c). 

Figure 5 : Complete annotated DNA sequences of clones FS3-124 (a, SEQ ID NO: 1) 
and FS3-135 (b, SEQ ID NO: 2). 

15 Figure 6 : Morphological differences between Streptomyces transconjugant (assay). 
Conjugations have been performed with FS3-124 modified with transposable construct 
pPAOI6; (control) conjugation have been performed with pPAOI6. 

Detailed Description of the Invention 

20 

The invention provides novel strategies, methods and products for generating and 
analysing combinatorial gene libraries. As indicated above, the invention discloses, 
particularly, methods of analysing libraries of polynucleotides, said polynucleotides 
being contained in cloning vectors having a particular host range, the methods 
25 comprising (i) selecting cloning vectors in the library which contain a polynucleotide 

having a-particular- characteristic, (ii) modifying- said selected-cloning vectors to. allow.a .._ 

transfer of said vectors and/or expression of the polynucleotide which they contain into a 
selected host cell, and (iii) analysing the polynucleotides contained in said modified 
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vectors upon transfer of said modified vectors into said selected host cell, such as by 
genetic, biochemical, chemical or phenotypical approaches. 

Library of polynucleotides 

5 

The term "library of polynucleotides" designates a complex composition comprising a 
plurality of polynucleotides, of various origins and structure. Typically, the library 
comprises a plurality of unknown polynucleotides, i.e., of polynucleotides whose 
sequence and/or source and/or activity is not known or characterized. In addition to such 

10 unknown (or uncharacterised) polynucleotides, the library may further include known 
sequences or polynucleotides. Typically, the library comprises more than 20 distinct 
polynucleotides, more preferably at least 50, typically at least 100, 500 or 1000. The ; 
complexity of the libraries may vary. In particular, libraries may contain more than 5000, 
10 000 or 100 000 polynucleotides, of various origin, source, size, etc. Furthermore, the 

15 polynucleotides are generally cloned into cloning vectors, allowing their maintenance 
and propagation in suitable host cells, typically in E. colu The polynucleotides in the 
library may be in the form of a mixture or separated from each other, in all or in part. It 
should be understood that some or each polynucleotide in the library may be present in 
various copy numbers. 

20 

The polynucleotides in the libraries are more preferably obtained or cloned from 
complex sources of nucleic acids, most preferably from environmental samples. Such 
libraries are also termed << metagenomic libraries" since they contain nucleic acids 
derived from whole genomes of complex mixtures of microorganisms. 

25 

The term environmental sample designates, broadly, any sample containing (a plurality 
of) uncharacterized (micro)organisms, particularly uncultivated (or non-cultivable) 
microorganisms. The sample may be obtained or derived from specific organisms, 
natural environments or from artificial or specifically created environments (e.g., 
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industrial effluents, etc). An uncultivated (or non-cultivable) microorganism is a 
microorganism that has not been purposely cultured and expanded in isolated form. The 
sample may be obtained or derived from soil, water, mud, vegetal extract, wood, 
biological material, marine or estuarine sediment, industrial effluents, gas, mineral 
5 extracts, sand, natural excrements, meteorits etc. The sample may be collected from 
various regions or conditions, such as tropical regions, deserts, volcanic regions, forests, 
farms, industrial areas, household, etc. 

Environmental samples usually contain various species of (uncharacterized, 
10 uncultivated) microorganisms, such as terrestrial microorganisms, marine 
microorganisms, salt water microorganisms, freshwater microorganisms, etc. Species of 
such environmental microorganisms include autotrophe or heterotrophe organisms, 
Eubacteria, archaebacteria, algae, protozoa, fungi, viruses, phages, parasites, etc. The 
microorganisms may include extremophile organisms, such as thermopbiles, 
15 psychrophiles, psychrotophes, acidophiles, halophiles, etc. More specific examples of 
environmental bacteria includes actinomycetes, eubacteriaes and mycobacteriaes, 
examples of fungi include phycomycetes, ascomycetes and basidiomycetes, etc. Other 
organisms include yeasts (saccharomyces, kluyveromyces, etc.) plant cells (algae, 
lichens, etc), corals, etc. for instance. The sample may comprise various species of such 
20 donor (uncultivated) microorganisms, as well as various amounts thereof The 
environmental sample may contain, in addition, known and/or cultivable 
microorganisms (e.g., prokaryotic or eukaryotic), as well as nucleic acids and organic 
materials. The sample may also contain mammalian cells or animal cells or insect cells 
(arising from larvae, feces, etc.). 

25 

It should be understood that the present invention is not limited to any specific type of 
sample or environmental microorganism. Tout can be used to produce diversity, create 
nucleic acid libraries, etc., from any environmental sample comprising uncultivated 
microorganisms. The sample may be wet, soluble, dry, in the form of a suspension, 
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paste, powder, solid, etc. Preferably, the sample is dry or in solid or semi-solid state 
(e.g., paste, powder, mud, gel, etc.). The sample may be treated prior to nucleic acid 
extraction, for instance by washings, filtrating, centrifuging, diluting, drying, etc. 

5 The term "environmental DNA" designates any DNA fragment or collection obtained 
from an environmental sample. Nucleic acids may be extracted / isolated from the 
sample according to various techniques, such as those described in WO01/81357, in 
WO01/40497, in Handelsman et al (Chemistry & Biology 5(10), 1998, R245), Rondon 
et al (Tibtech 17, 1999, 403 ; Applied and Environm. Microbiol. 66, 2000, 2541), Miller 

10 et al (Applied and Environm. Microbiol. 65, 1999, 4715) or Frostegard et al (Applied 
and Environm. Microbiol., 65, 1999, 5409). 

In a particular embodiment of the above method, the library comprises a plurality of 
environmental DNA fragments. The library may also comprise other types of nucleic 
1 5 acids, such as environmental RNAs, for instance. 

The polynucleotides have a size which is typically comprised between 10 and 100 kb, 
more preferably between 20 and 80 kb, typically between 30 and 80 kb. Although not 
mandatory, it is preferred that the polynucleotide fragments in the library all have similar 
20 size, to produce homogenous libraries. 

Cloning Vectors 

As indicated, the polynucleotides are contained or cloned into cloning vectors. These 
25 vectors may be of various types, including plasmids, cosmids, fosmids, episomes, 
artificial chromosomes, phages, viral vectors, etc. In most preferred embodiments, the 
cloning vectors are selected from plasmids, cosmids, phages (e.g., PI -derivatives) and 
BACs, even more preferably from cosmids, PI derivatives and BACs. By using cosmids 
or PI derivatives, it is possible to generate homogenous libraries, since these vectors 
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essentially accommodate polynucleotides having a size of approximately 40 kb and 80 
Kb, respectively. Furthermore, since the invention provides that the vectors are modified 
after the initial cloning step, the cloning capacity of the vectors is maximized and inserts 
of as much as 40 and 80 kbs in length can be cloned into cosmids and PI derivatives. 

5 

As indicated the cloning vector has a particular host range, i.e., the ability to replicate in 
a particular type of host cell. Typically, the host is a bacteria, more preferably an E. coli 
strain. Indeed, K coli is so far the most convenient host cell for performing recombinant 
technologies. The advantage of the present invention is that the starting library can be 
1 0 produced in any suitable host system of choice, since the properties of the libraries will 
be adapted later during the process. 

Cloning vectors generally comprise the polynucleotide insert and genetic elements 
necessary and sufficient for maintenance into a competent host cell. They typically 

15 contain, in addition to the polynucleotide insert, an origin of replication functional in a 
selected host cell as well as a marker gene for selection and screening. The cloning 
vector may comprise additional elements, such as promoter regions, for instance. 
Although cloning vectors may replicate in several different host cells, they are usually 
adapted to a particular host cell type and not suitable or efficient for replication or 

20 maintenance in other cell types. 

In a preferred embodiment, the cloning vectors of the library are E. coli cloning vectors, 
preferably cosmids , BAC or PI derived vectors. E. coli cloning vectors may carry an 
origin of replication derived from naturally-occurring plasmids, such as ColEl, pACYC 
25 and pl5A> for instance. Many E.coli cloning vectors are commercially available and/or 
can be constructed using available regulatory sequences. 



Screening of the clo ninfi vectors 
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In step i) of the method, a first selection or screen is performed on the polynucleotide 
library. The screen is performed so as to identify or select clones having (or lacking) a 
particular, common characteristic. The selection may be carried out according to various 
techniques, such as by molecular screening, protein expression, functional screening, etc. 

5 A preferred selection is performed by molecular screening. Molecular screening 
designates any method of identification of molecular or structural characteristics in a 
polynucleotide sequence. This can be made by a variety of techniques which are known 
per se, such as hybridisation, amplification, sequencing, etc. Preferably, molecular 
screening comprises the selection of clones in a library which contain, in their sequence, 

10 a particular sequence or region or motif* said sequence or region or motif being 
characteristic of a particular type of activity or gene (enzyme, biosynthetic pathways, 
etc.). 

In a first variant, the selection is made by contacting the cloning vectors in the library 
15 with a particular nucleic acid probe (or set of probes) containing a sequence which is 
characteristic of a selected activity or function (a consensus sequence, a particular niotif, 
etc.). The cloning vectors in the library which hybridise to the probe (or set of probes) 
are then selected. 

20 In a second variant, the selection is made by contacting the cloning vectors in the library 
with a particular pair of nucleic acid primers specific for a sequence which is 
characteristic of a selected activity or function (a consensus sequence, a particular motif, 
etc.), and a PCR amplification reaction is performed. The cloning vectors in the library 
which lead to a positive amplification product are then selected. 

25 

In this regard, the present application provides new primers designed in conserved 
motives of the p-keto acyl synthase gene, which are particularly useful for screening 
polynucleotides containing putative PKS genes or domains. These primers have the 
following degenerated sequence : 



12 



Sense primer : 5>- GGSCCSKCSSTSDCSRTSGAYACSGC -3* (SEQ ID NO: 3) 
Antisense primer : 5'- GCBBSSRYYTCDATSGGRTCSCC -3> (SEQ ID NO: 4) 
wherein : 
Ris AorG 
5 S is G or C 
YisCorT 
KisGorT 
Dis AorGorT,and 
BisCorGorT. 

10 

A particular object of this invention is a polynucleotide primer having one of the above 
sequences, typically a mixture of different polynucleotide primers having a sequence 
corresponding to one of the above degenerated sequences. A particular object of this 
invention also resides in a pair of primers each having one of the above sequences. 

15 

Once particular clones have been selected, the analysis of their polynucleotides needs to 
be confirmed and/or validated, and/or their polynucleotides can be used to produce novel 
compounds or metabolites. 

20 The invention now enables such further analysis and uses, by allowing a modification of 
the cloning vectors that is specific and adaptable by the skilled person, depending on the 
activity which is sought. In particular, it is possible to confer properties such as specific 
expression or a novel, specific host range to the selected cloning vectors, to assess their 
activity, as disclosed below. 

25 

Modification of the cloning vectors 



After high efficiency cloning using most convenient cloning vectors such as BACs or 
cosmids propagated into E. coli, and after the identification, selection and/or 
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characterisation of cloned DNA fragments, the invention now allows to modify 
specifically the cloning vectors to transfer, maintain, express and/or over-express the 
selected polynucleotides into any selected host expression system, which is suitable to 
assess the selected activity or property. Such selected hosts may he native or 
5 heterologous host cells, and include, but are not limited to, for example Streptomyces, 
Nocardia, Bacillus, fungi, yeasts, etc. 

The selected cloning vectors of the library may be modified according to various 
techniques. The modification is typically a genetic modification, comprising the 

10 introduction of particular genetic sequences into the structure of the cloning vector, in 
addition to or in replacement of sequences contained in said vector. It is highly preferred 
to use specific or targeted (or oriented) techniques to improve the efficacy of the method. 
By "specific" is meant that the modification occurs at a pre-determined location in the 
cloning vector, through site-specific mechanisms. By 'targeted" is meant that the 

15 modification occurs in a controlled way, so as not to alter the polynucleotide insert 
contained in the vector in a non-desirable way. 

In a preferred embodiment, the selected vectors are modified by insertion, into the 
vector, of a target polynucleotide construct which contains genetic elements conferring 
20 the selected property(ies) to the cloning vector. 

The target polynucleotide construct typically comprises the genetic elements necessary 
to transfer, propagate, maintain , express or overexpress the cloned polynucleotide into a 
chosen (bacterial) host expression system. Said genetic elements may include particular 
25 origin(s) of replication, particular origin(s) of transfer, particular integrase(s), 
transcriptional promoters) or silencer(s), either alone or in combinations). 

In a first, preferred variant, the target polynucleotide construct comprises a genetic 
element allowing transfer of the vector into a selected host cell. 
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Natural DNA .transfer mechanisms between donor and recipient strains is known under 
the term conjugation or conjugative transfer. Conjugative transfer can occur between 
different strains of the same species as well as between strains of different species. Many 
5 naturally occurring plasmids carry so called tra genes, which are involved in and 
mediate conjugative transfer. The DNA transfer starts at specific DNA structures, known 
as an origin of transfer or "ori T". The presence of such an oriT in a vector allows said 
vector to be transferred into a desired host cell. 



10 In a particular, preferred embodiment, the target polynucleotide construct comprises an 
origin of transfer functional in the selected host cell, more particularly in a Streptomyces 
ox Bacillus host cell. 



The structure of various oriT has been reported in the art (Guiney et al. 1983; Zechner et 
15 al. 2000). In a specific embodiment, the origin of transfer is selected (or derived) from 
RP4, pTiC58, F, RSF1010, ColEl and R6K(a). 



A specific example of an oriT which can be used in the present invention derives from 
plasmid RP4 and has or comprises all or a functional part of the following sequence 
20 (SEQIDNO:5): 



gatctGTGATGTACTTCACCAGCTCCGCGAAGTCGCTCTTCTTGATTGGAGCGCATGGG 

GACGTGCTTGGCAATCACGCGCACCCCCCGGCCGTTTTAGCGGCTAAAAAAGTCATG 

GCTCTGCCCTCGGGCGGACCACGCCCATCATGACCTTGCCAAGCTCGTCCTGCTTCT 

25 CTTCGATCTTCGCCAGCAGGGCGAGGATCGTGGCATCACCGAACCGCGCCGTGCGC 
GGGTCGTCGGTGAGCCAGAGTTTCAGCAGGCCGCCCAGGCGGCCCAGGTCGCCATT 
GATGCGGGCCAGCTCGCGGACGTGCTCATAGTCCACGACGCCCGTGATTTTGTAGCC 
CTGGCCGACGGCCAGCAGGTAGGCCGACAGGCTCATGCCGGCCGCCGCCGCCTTTTC 
CTCAATCGCTCTTCGTTCGTCTGGAAGGCAGTACACCTTGATAGGTGGGCTGCCCTT 

30 CCTGGTTGGCTTGGTTTCATCAGCCATC 

GCCGGCCAGCCTCGCAG AGCAGG^A^CCCGTTCjAGCACCGCCAGGTGCGAATAAGG 

GACAGTGAAGAAGGAACACCCGCTCGCGGGTGGGCCTACTTCACCTATCCTGCCCG 

GCTGACGCCGTTGGATACACCAAGGAAAGTCTACACGAACCCTTTGGCAAAATCCT 
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GTATATCGTGCGAAAAAGGATGGATATACCGAAAAAATCGCTATAATGACCCGGAA 
GCAGGGTTATGCAGCGGAAAAGATCCGTCGGATCT 

The term "functional part" designates any fragment or variants of the above sequence 
5 which retain the capacity to cause conjugative transfer. Such fragments typically 
comprise at least 80%, preferably at least 85% or 90% of the above sequence. Variants 
may include one or several mutations, substitutions, deletions or additions of one or 
several bases. 

10 In an other particular variant, the target polynucleotide construct comprises a genetic 
element allowing integration of the vector into the genome of the selected host cell. 

A donor DNA is permanently or stably maintained and expressed in a selected recipient 
cell if it is integrated into the recipient cell's genome or if it contains elements that allow 
1 5 autonomous replication in said cell. 

Integration is a preferred way of ensuring stable expression. Integration can be obtained 
by physical recombination. Recombination can be homologous, e.g., between two 
homologous DNA sequences, or illegitimate, where recombination occurs between two 

20 non-homologous DNAs. As a particular example, integration of donor DNA into the 
chromosome of the recipient can be mediated by host recombination repair system or by 
site-specific recombination. Another well-studied process that can transfer and integrate 
genes is transduction by bacterial viruses, such as X and <|>C31. In a phage-infected 
bacterial cell, fragments of the host DNA are occasionally packaged into phage particles 

25 and can then be transferred to a recipient cell. Integration into the recipient cell's genome 
is caused by an integrase. 

In a specific embodiment, the target polynucleotide construct comprises a nucleic acid 
encoding an integrase functional in the selected host cell. More preferably, the integrase 
30 is selected from X and (|>C31 integrases. In a specific embodiment, the polynucleotide 
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construct comprises a nucleic acid encoding an integrase having or comprising all or a 
functional part of the following sequence of the <|>C31 integrase (SEQ ID NO: 6) : 

AGATCTCCCGTACTGACGGACACACCGAAGCCCCGGCGGCAACCCTCAGCGGATGC 
CCCGGGGCTTCACGTITrCCCAGGTCAGAAGCGGTITrCGGGAGTAGTGCCCCAACT 
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CCCGGGGCTTCACGT1 riCCUAUU l ^a<j/w\«jv_vjvj a i i x^wn.^ — — * - - — 

OGGGTAACCTTTGAGTTCTCTCAGTTGGGGGCGTAGGGTCGCCGACATGACACAAG 
GGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCGCGCGAG 
CGCGAGAATTCGAGCGCAGCAAGCCCAGCGACACAGCGTAGCGCCAACGAAGACA 
AGGCGGCCGACCTTCAGCGCGAAGTCGAGCGCGACGGGGGCCGGTTCAGGTTCGTC 
GGGCATTTCAGCGAAGCGCCGGGCACGTCGGCGTTCGGGACGGCGGAGCGCCCGGA 
GTTCGAACGCATCCTGAACGAATGCCGCGCCGGGCGGCTCAACATGATCATTGTCTA 
TGACGTGTCGCGCTTCTCGCGCCnrGAAGGTCATGGACGCGATTCCGATrGTCTCGGA 
^^^^n^r^OTOACGATTGTTTCCACTCAGGAAGGCGTCTrCCGGCAGGG 




GGTACGTCGGCGGGAAGGCGCCTTACGGCTTCGAUCJl lot l h.uuau^u™^ 
ATCACGCGCAACGGCCGAATGGTCAATGTCGTCATCAACAAGCTTGCGCACTCGACC 
ACTCCCOTACCGGACCCTrCGAGTTCGAGCCCGACGTAATCCGGTGGTGGTGGCGT 
GAGATCAAGACGCACAAACACCTTCCCTTCAAGCCGGGCAGTCAAGCCGCCATTCA 
CCCGGGCAGCATCACGGGGCTTTGTAAGCGCATGGACGCTGACGCCGTGCCGACCC 
GGGGCGAGACGATTGGGAAGAAGACCGCTTCAAGCGCCTGGGACCCGGCAACCGTT 
ATGCGAATCCTrCGGGACCCGCGTATTGCGGGCTTCGCCGCTGAGGTGATCTACAAG 
* * HAAGCCGGACGGCACGCCGACCACGAAGATTGAGGGTTACCGCATTCAGCGCGA 
„ ™ a ^^o^^rnonTrGArTfTTTGATTGCGGACCGATCATCGAGCCCGCTGA 



In a more" preferred variant, "the target pbljnaucleotide construct comprises -genetic 
elements allowing transfer of the cloning vector into the selected host cell and 
integration of the cloning vector or a portion thereof into the genome of the selected host 
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cell. Most preferred polynucleotide constructs comprises an oriT and a nucleic acid 
encoding an integrase. 

ta an other variant, the targe, polynucleotide construct comprises an origin of replieation 
apeeifie for or functional in the selected hoat eell. The origin of replication may be 
selected (or derived), for instance, tan pAMBl, pSa, 2pm circle, pSam2, pSGl, pDTOl, 
SCP2, pA387 and artificial chromosomes. 

to an other variant, the target polynucleotide construct comprises a transcriptional 
promoter ftncttonal in the selected host cell. Aa indicated above, in a particular variant, 
the invention allo W s to modify the cloning vector to enable expression or over- 
expression of the cloned polynucleotides in the selected host. The expression of genea la 
driven mainly by tianseriptional promoters, which initiate gene transcription. The type of 
promoter to he used in the present invention can be selected by the skilled person, 
depending on the selected host cell and We of expression needed. Promoters may be 
ubiquitous or cell^pecific, regulated or constitutive, weak or strong. They may be of 
various origins, including promoters isolated from viruses, phages, plant cells, bacterial 
genes, mammalian genes, etc., or they may be artificial or chimeric. Typical examples of 
promoters include T7, T4, LacZ, trp, ara, SV40, tac, OU GAL, AOX, bsp-70, etc. 

20 

The target polynucleotide construct is typically a DNA molecule, although RNAs may 
atao be used aa starting material. It la typically a double-stranded DNA. The target 
polynucleotide construct may be produced by eonveutional recombinant DNA 
techniques, including DNA synthesis, cloning, ligation, restriction digestion, etc. and a 
25 combination thereof 

The target polynucleotide construct is preferably engineered so as to be inserted in a 
region of the vector distinct from the polynucleotide. Indeed, it is important that the 
integrity of the polynucleotides ia preserved. Directed insertion may be accomplished in 
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a variety of ways, including site-specific insertion using particular enzymatic systems 
(Cre/Lox, FLP, etc.), homologous recombination with particular target sequences present 
in the vector, or by the use of transposons or transposable elements and appropriate 
selection means. 

5 

In a particular, preferred embodiment, the target polynucleotide construct is contained in 
a transposable nucleic acid construct. Indeed, in a preferred variant, the methods of the 
present invention use transposable elements to alter the properties of the cloning vectors, 
and allow their transfer, maintenance, expression or over-expression in a selected host 
10 cell. 

Transposable nucleic acid constructs are derived from transposons, which are genetic 
elements capable of moving from one genetic loci to another. Two main classes of 
transposons have been identified in bacteria. The most simple transposons comprise an 
15 insertion sequence that carries only elements of transposition. These elements are two 
inverted DNA repeats and a gene that codes for a protein called transposase. The 
transposase catalyses the excision and integration of the transposon. It has been shown 
that the excision and integration reaction can be catalysed in trans by a transposase, 
which can be provided in vivo or in vitro in purified form or expressed from a different 
construct More complex transposons carry more insertion sequences and additional 
genes that are not involved in transposition. 

Transposable nucleic acid constructs of this invention thus typically comprise, flanked 
by two inverted repeats, the target polynucleotide construct and, more preferably, a 
marker gene. In the presence of a transposase, these transposable nucleic acid constructs 
can integrate into a cloning vector in vivo or in vitro, thereby providing for targeted 
- polynucleotide insertion: Alternatively, such' nucleic "acid constructs can be used for" 
targeted integration, in the absence of a transposase, in particular strains such as 
hypermutator strains. Such transposable nucleic acid constructs also represent a 




19 



particular object of the present application. In this regard, in a more preferred 
embodiment, the invention also relates to a transposable nucleic acid construct, wherein 
said construct comprises an origin of transfer flanked by two inverted repeats. Specific 
examples of such construct are transposons pPLl and pPAOI6, as disclosed in the 
5 experimental section. The transposable nucleic acid construct may further comprise an 
integrase gene and/or a marker gene. 

The inverted repeat nucleic acid sequences may be derived from the sequence of various 
transposons, or artificially created. In particular, transposable elements can be generated 
10 using inverted repeats obtained from transposons or transposable elements such as TnS, 
Tn21, miniTnS, T7, T10, Tn917, miniTn400, etc. Preferably, the sequences derive from 
transposon TnS. In a specific embodiment, they comprise all or a functional part of the 
following sequences : 

15 - left arm of pPAOI6 transposon (SEQ ID NO: 7) 
CTGTCTCTTATACACATCTCA 
TTGGCTCGAATTCGAGCTCGGTACCC 

- right arm of transposon pPAOI6 (SEQ ID NO: 8) 
20 GATCCTCTAGAGTCGACCrGCAGGCATGCAAGCTTGCCAACGACTACGCACTAGCCA 
ACAAGAGCTTCAGGGTTGAGATGTGTATAAGAGACAG 

The marker gene may be any nucleic acid encoding a molecule whose presence in a cell 
can be detected or visualized. Typical marker genes encode proteins conferring 
25 resistance to antibiotics, such as apramycine, chloramphenicol, ampiciline, kanamycine, 
spectinamycine, thiostrepton, etc. Other types of markers confer auxotrophy or produce a 
label (e.g., galactosidase, GFP, luciferase, etc). 

In a specific embodiment, the cloning vector in the library comprises a first marker gene 
30 and the modification step ii) comprises: 
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. contacting in vitro, in the presence of a transposase, the selected cloning vectors with a 
transposon comprising, flanked by two inverted repeats, the target polynucleotide 
construct and a second marker gene distinct from the first marker gene, and 
. selecting the cloning vectors which have acquired the second marker gene and which 
5 have lost the first marker gene. 

The double selection ensures that the target polynucleotide construct has been inserted at 
a site within the marker gene present in the cloning vector, i.e., outside of the 
polynucleotide insert. 

10 

It should be understood that the modification may be accomplished in various other 
ways, particularly by incorporating a sequence coding for the transposase directly into 
the transposable element or into another expression unit. The presence and expression of 
the transposase can be regulated by inductive promoter or termosensitive replicative 
1 5 units. Also, transposition can be carried out by in vitro process. 

Analysis of the polynucleotides 

In step (iii) of the process, the polynucleotides may be analysed by various methods, 
20 including by genetic, biochemical, chemical or phenotypical approaches, which are well 
known per se in the art Analysis occurs upon transfer and, optionally, expression of the 
polynucleotides into the selected host cell. 

In this regard, the modified cloning vectors can be transferred into the selected host cell 
25 by a variety of techniques known in the art, including by transformation, electroporation, 
transfection, protoplast fusion, conjugative transfer, etc. In a preferred embodiment, the 
target polynucleotide construct comprises an oriT -and -the-Tnodified vectors- are 
transferred into the selected host cells by conjugative transfer. In this embodiment, the 
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cloning vector and the selected host cells are co-cultivated and the recombinant host 
cells are selected and isolated. 

The selected host cell may be any type of cell or microorganism, including, without 
5 limitation, Steptomyces, E. coli, Salmonella, Bacillus, Yeast, jungi, etc. 

One of the objectives of the invention is to be able to analyse environmental DNAs of 
unknown cellular origin into different host expression systems. In order to analyse the 
potentiality of the DNAs at the transcription and/or translation levels and to have much 
10 more probabilities to have a DNA expression, it is important to have the possibility to 
test different host expression systems. 

Insertion of foreign DNA into host expression systems like Streptomyces can produce, 
for instance, an increase in doubling time, morphological modifications, pigments 
15 production, etc. which can be related either directly to the expression of foreign DNA or 
by combinatorial biology of the foreign DNA and the biology of the expression host 
systems. The new characters can be analysed by all techniques known in the art such 
genetic, biochemical, chemical, phenotypic approaches etc. 

20 In a preferred, specific embodiment, the invention relates to a method for the 
identification or cloning of polynucleotides encoding a selected phenotype, the method 
comprising (i) cloning environmental DNA fragments into E. coli cloning vectors to 
produce a metagenomic library, (ii) identifying or selecting cloning vectors in said 
library which contain DNA fragments having a particular characteristic of interest, (iii) 

25 modifying the identified or selected cloning vectors into shuttle or expression vectors for 
transfer or expression in a selected host cell, (iv) transferring or expressing the modified 
cloning vectors into said selected host cell and (v) identifying or cloning the DNA 
fragments contained in said modified cloning vectors which encode said selected 
phenotype in said selected host cell. 
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By applying the above method, new polynucleotide sequences have been identified, 
cloned and characterized, which produce new phenotypes in bacteria. These 
polynucleotides contain the sequence of PKS genes and encode polypeptides involved in 

5 biosynthetic pathways. The sequence of these polynucleotides is provided in Figure 5. 
The invention also relates to any polynucleotide sequence comprising all or part of these 
sequences (i.e., SEQ ID NOs: 1 or 2), their complementary strand, or a functional variant 
thereof. A part of the above sequences includes, preferably, at least 20 consecutive bases, 
more preferably at least 50 consecutive bases thereof, even more preferably a coding 

10 sequence (e.g., an ORF). In this respect, SEQ ID NOs: 1 and 2 comprise several novel 
open reading frames encoding novel polypeptides involved in biosynthetic pathways. 
These coding sequences are identified in Figures 5a and 5b. 

In a specific embodiment, the invention relates to a polynucleotide sequence comprising 
15 a sequence selected from nucleotides 76 - 1134 ; 1096 - 2430 ; 1178 - 1624 ; 2506 - 
3567 ; 2906 - 4222 ; 4092 - 5321 ; 6337 - 8502 ; 8181 - 9530 ; 9531 - 10721 ; 10504 - 
11274 ; 12874 - 13689 ; 14195 - 15976 ; 15427 - 16512 ; 15579 - 16253 ; 16505 - 17656 
; 17657 - 18697 ; 18615 - 19304 ; 19301 - 20596 ; 20535 - 21476 ; 22025 - 22951 ; 
23155 - 26523 ; 26409 - 34433 ; 34418 - 37500 and 35359 - 37500 of SEQ ID NO: 1 
20 (figure 5a) or a complementary strand thereof. 

In an other specific embodiment, the invention relates to a polynucleotide sequence 
comprising a sequence selected from nucleotides 3 - 914, 924 - 2168 ; 2207 - 3190 ; 
3373 - 4455 ; 4546 - 4959 ; 5176 - 6192 ; 6331 - 14043 ; 14275 - 15408 ; 15436 - 16245 
25 ; 16287 - 17384 ; 17427 - 18158 ; 18248 - 18847 ; 18952 - 20346 ; 20442 - 21167 ; 
21164 - 24301 ; 24351 - 27023 ; 27806 - 29686 ; 29535 - 30872 ; 30848 - 32647 ; 32574 
- 35555 ; 35533 - 36598 and 365T6 - 37400 of SEQ ID NO: 2 -(figure" 5b) of a 
complementary strand thereof. 




23 



The invention also relates to any vector comprising these polynucleotide sequences. 
These sequences may be DNA or RNA, preferably DNA, even more preferably double- 
stranded DNA. The invention also relates to a polypeptide encoded by a polynucleotide 
sequence as defined above, 

5 

An other object of this invention resides in a library of polynucleotides, wherein said 
library comprises a plurality of environmental DNA fragments cloned into cloning 
vectors, wherein said environmental DNA fragments contain a common molecular 
characteristic and wherein said cloning vectors are E. coli cloning vectors comprising a 
10 target polynucleotide construct allowing transfer or expression of the environmental 
DNA into a selected host cell distinct from E. colL 

The sub-DNA-libraries should have either desired genetic characteristics based on high 
or low GC content, DNA encoded for a desired enzymatic activity, part or full 

15 biosynthetic pathways for metabolites etc., or specific origin such as soil fractions, i 
animal organs, sub fraction of a microorganism community etc. The invention also 
allows to produce conjugative vector with desired characteristics in accordance with the 
characteristics of the pre-identified sub-DNA-libraries and functional analysis of mutants 
in heterologous hosts. It can also be used, without limitation, for the production of 

20 mutants by random or directed mutagenesis, for DNA sequencing, genes or biosynthetic 
pathways knock-out by deletion or insertion or to confer transfer capabilities for 
expression, co-expression, over-expression or modification of biosynthetic pathways. 

Further aspects and advantages of the invention will be disclosed in the following 
25 examples, which should be regarded as illustrative and not limiting the scope of this 
application. 



Experimental Section 
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In this work, we constructed a fosmid library in E. coli from DNA prepared directly 
from soil. We screened the library for presence of biosynthetic pathways. We developed 
genetic tools for functional genomics that allow gene inactivation and horizontal gene 
transfer from E. coli to Streptomyces. 

5 ' 

The cloning vectors in the library contain ColEl replicon for propagation in E. coli. 
Transposable elements based on Tn 5 transposon were produced and used for in vitro (or 
in vivo) modification of selected cloning vectors. Integrated transposable elements 
contain gene for resistance to apramicyne. Conjugative derivatives were constructed by 

10 incorporating origin of transfer from plasmid RP4. A conjugative and site specific 
integrative transposon was also constructed comprising the integrase gene from <|> C31 
phage, including attP attachement site. Conjugal transfer was demonstrated from an 
appropriate E. coli donor cell to another E. coli or Streptomyces lividans recipient cell. 
Constructed transposon was tested for inactivation of the genes cloned into fosmids. 

15 Obtained mutants can be used for direct sequencing by adequate primers and transferred 
by conjugation into Streptomyces lividans. Transposable elements thus represent very 
useful tools for functional analysis of a large DNA libraries cloned into BAC, PAC 
fosmids or other cloning vectors in which cloned inserts must be transferred into 
heterologous host 

20 Materials and methods 

Bacterial strains, plasmids and growth conditions. 

E. coli DH10B (F- mcrA delta(mrr-hsdRMS-mcrBC) phi80dlacZ deltaM15 delta lacX74 
deoR recAl endAl araD139 delta (ara, leu)7697 galU galK lambda- rpsL nupG), strain 
(Epicentre) was used for fosmid and plasmid transformation and DNA amplification. 
25 Unless specifically described, all DNA manipulations were performed according to 
Sambrook, J., et al, (1989). 
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Soil DNA extraction and DNA libraries construction. 

Total bacterial community DNA was extracted and large DNA libraries have been 
constructed into fosmids according to the method described in WO 01/8 1 357. 

Fosmid DNA extraction and Purification 

5 

Fosmids DNA containing soil librairy were extracted from pools of 96 clones. Culture of 
recombinant clones were performed in Deep-Well 96 and 48, respectively in 1 ml and 2 
ml of LB media containing 12,5 ng ml" 1 of chloramphenicol. Cultures were grown at 
37°C with shaking at 250 RPM during 22 hours. DNA extraction was done by using the 
10 Nucleobond PCI 00 extraction kit (Macherey Nagel). 

PCR screening for the detection of PKS genes 
Primers design 

Degenerate PCR primers sets were designed to specifically amplify PKS nucleic 
15 sequences. Multiple sequence alignment of PKS domains revealed highly conserved 
motives, in particular in the p-keto acyl synthase domain. Primers Lib I F and Lib2R 
were designed in conserved motives of the p-keto acyl synthase gene. LiblF (sense 
primer, 5*- GGSCCSKCSSTSDCSRTSGAYACSGC -3 4 ) and Lib2R (antisense primer, 
5'- GCBBSSRYYTCDATSGGRTCSCC -3*) were deduced from P-keto acyl synthase 
20 peptide sequences GP(AS)(LV)(AST)(IV)DTAC and GDPffi(TVA)(RAQ)A, 
respectively. The specific fragment amplified with LiblF / Lib2R was approximately 
about 465 bp (corresponding to 155 amino acids). Specificity and efficiency of the PCR 
systems were validated by testing on positive DNA controls (i.e. genomic DNA from 
type I PKS producing strain such as Bacillus subtilis, Streptomyces lividans, 
25 Streptomyces ambofaciens and Ralstonia solanocearum) and negative DNA controls 
(genomic DNA from strains which are known to do not contain PKS genes). 
Furthermore, DNA extracted from soil samples were tested to calibrate PCR techniques. 



PCR conditions 
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PGR conditions were optimised, in particular for concentrations of DMSO, MgCl^ 
Primers and DNA quantities. For PCR using microorganism genomic DNA and soil 
DNA as template (50 to 200 ng), the PCR mix (50pl) contained 250pM of dNTP, 5 mM 
MgCl 2 final, 2,5% DMSO, IX PCR buffer, 0,75 juM of each primer and 2,5 U of Taq 
5 DNA polymerase (Sigma) and sterile distilled water. For PCR using fbsmid pooled DNA 
as template (100 to 500 ng), the PCR mix (50|ud) contained 250|iM of dNTP, 5 mM 
MgCl 2 final, 5% DMSO, IX PCR buffer, 0,75 nM of each primer and 2,5 U of Taq 
DNA polymerase (Sigma) and sterile distilled water. For identification of positives 
clones in 96 microtiter plates, 25 fil of each bacteria culture were used as template and 

10 PCR conditions were the same as above. Thermocycling program was : a denaturation 
step at 96°C for 5 minutes; then 1 minute at 96°C, 65°C for 1 minute, 72°C for 1 minute. 
The first 7 cycles, the annealing temperature was lowered 1°C per cycle until 58°C was 
reached, A subsequent 40 cycles were carried out with the annealing temperature at 
58°C. A final extension step was at 72°C for 7 minutes. For identification of positives 

15 clones in 96 microtiter plates, the first denaturation step of 96°C was during 8 minutes. 
The other steps were the same as described above. PCR reactions were performed with a 
PTC 200 thermocycler (MJ Research). 

PCR products analysis 

20 PCR products of about 465 bp were purified on agarose gel with gel extraction Kit 
(Qiagen) according to the manufacturer recommendations. First approach consisted in 
subcloning PCR products using the Topo PCR II kit (Invitrogen). Recombinant Plasmids 
were extracted using QIAprep plasmid extraction Kit (Qiagen) and sequenced with 
Forward and Reverse M13 primers with CEQ 2000 automated sequencer (Beckman 

25 Coulter). Second approach consisted in direct sequencing of PCR products. Sequencing 
data were compared with nucleic and proteic genbank database using BLAST program. 

Sequencing of the identified fosmid insert DNA and sequence analysis 
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Fosmids inserts were sequenced using either a transposon-mediated and by shotgun 
subcloning approach. Transposition was realized by using (Transposition Kit) 
commercialized by Epicentre according to the manufacturer. For shotgun subcloning, 
transformants were grown for 16 hours at 37°C. Fosmid extraction was done by using 
5 the Nucleobond PCI 00 extraction kit (Macherey Nagel). DNA was partially restricted 
with Sau3A and sized on standard gel elecrtphoresis for fragments ranging from 1 to 3 
Kbs and cloned into Bluescript vector according to Sambrook et a/. (1 989). 
Sequence analysis was performed with the identification of ORFs by using Frameplot of 
the GC3. Each identified ORF was compared to gene databases by using BLAST 
10 program. PKS domains were determined by aligning obtained sequence versus already 
described PKS domains from domain databases. 

Sequencing 

Sequencing reactions were performed with 1 jig of DNA and 3.6 pmol of primer, using 
CEQ 2000 Dye Terminator Cycle Sequencing kit (Beckman Coulter) under conditions 
15 proposed by supplier. Ten jiL of reaction products were precipitated using 4 yL, of 
solution containing 1.5 M NaOAc, 50 mM EDTA and 60 jliL cold 95% ethanol/dH 2 0 
from -20 °C. The pellet was washed 2 times with 200 yxL 70% ethanol/dH 2 0, vacuum 
dried and dissolved in 40 jliL sample loading solution (supplied in kit). Sequencing 
reactions were run on an CEQ 2000 sequencer (Beckman Coulter). 

20 

Plasmids construction and validation 

Plasmid pPl was constructed as follows. A 941 bp DNA fragment containing native 
promoter region and AA(3)IV gene was amplified by polymerase chain reaction (PCR) 
using primers AmF ( d-CCCTAAGATCTGGTTCATGTGCAGCTCCATC, SEQ ID 
25 NO: 9) and AmR ( d-TAGTACCCGGGGATCCAACGTCATCTCGTTCTCC, SEQ ID 
NO: 10). One hundred microlitre reaction were performed containing 0.1 each of 
primers, 1 X Vent DNA polymerase buffer (NEB), 0.2 pM of each deoxyribonucleoside 
triphosphate (dNTP), 50 ng of the DNA template and 2U of Vent DNA polymerase 
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(NEB). PCR mixture was heated for 4 min at 94 °C in a PTC-200 thermocycler (Peltier) 
and cycled 25 x at 94 °C for 60 sec, 59 °C for 30 sec and at 72 °C for 70 sec. The final 
extension was performed at 72 °C for 7 min. 

PCR product was purified using GFX DNA purification kit (Amersham), then digested 
5 by Bgl II and Sma I restriction enzymes. A 941 bp Bglll/Smal fragment was inserted into 
the Bam HI, Sma I sites of pMOD plasmid (EPICENTRE). DH10B E. coli was 
transformed with pPl and subjected to apramicyne selection on LB agar plates. Six 
colonies surviving on apramicyne selection were grown in liquid LB media and final 
pPLl candidates were thoroughly checked via PCR and restriction mapping. 

10 • ' • 

To construct conjugative plasmid pPLl, 750 bp oriT DNA region from plasmid RP4 
were amplified via PCR using primers oriTF (d- 
GCGGTAGATCTGTGATGTACTTCACCAGCTCC, SEQ ID NO: 11) and oriTR 
(TAGTACCCGGGGATCCGACGGATCTTTTCCGCTGCAT, SEQ ID NO: 12). PCR 

15 conditions were as above. Amplified DNA was digested using Bgl II and Smal 
restriction enzymes. BglWSmal DNA fragment was subjected to purifiation after gel 
electophoresis on 0.7% agarose. Purified fragment was ligated into pPl plasmid digested 
by Bam HI and Sma I restriction enzymes. 

20 <|> C31 integrase gene and attachment site (attP) was amplified via PCR using primers 
Fint (d-AACAAAGATCTCCCGTACTGACGGACACACCG, SEQ ID NO: 13) and PJ 
(d-CGGGTGTCTCGCATCGCCGCT, SEQ ID NO: 14). Amplified DNA fragment was 
purified by GFX kit (Amersham) and phosphorilated using T4 polynucleotide kinase 
(NEB) under conditions recommended by the enzyme manufacturer. Phosphorilated 

25 DNA fragment was cloned in to pPLl vector opened with Smal restriction enzyme 
(NEB) and dephosphorilated by calf alkaline phosphatase (NEB). DH10B E coli was 

transformed with ligation mixture using Bio Rad "Pulsing" apparatus and protocols- 
provided by Bio-Rad. Twelve transformants were analyzed by PCR for the presence of 
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integrase gene. Orientation of integrase gene was verified by restriction analysis using 
BglU and EcoRl restriction enzymes. Resulting plasmid was named pPAOI6. , 
To construct pPAOI6-A plasmid, pPAOI6 plasmid was digested with EcoTtl and BglK 
restriction enzymes followed bydigestion by Bean mung nuclease (NEB). Linearised 
5 plasmid was selfligated and transformed in to DH1 OB cells. 

Plasmid preparation of fosmid DNA 

Fosmid and BAC DNA for sequencing was prepared by using the Nucleobond AX kit 
(Macherey-Nagel), following protocol for B ACs, Cosmid as specified by manufacturer. 

Mutagenesis 

10 Transposon Tn-pPAOI6 was prepared by digestion of pPAOI6 plasmid using PvuIL 
restriction enzymes, followed separation on agarose gel and purification of fragment 
containing transposon from gel using Qiagen kit The same molar ratio of transposon 
and corresponding fosmid was used for mutagenesis in vitro using Tn5 transposase 
(Epicentre) and conditions specified by manufacturer. We transformed aliquot of the 

15 transposed mixture by electroporation into competent DH10B E. coli strain. 

Conjugation E. coli- Streptomyces lividans TK24 

Conjugation experiments were done using 6 x 10 6 E coli S17.1 cells containing 
conjugative plasmids or fosmids. The E coli cells were grown in LB media with adequat 
antibiotic. The cells were collected by centrifugation, washed two times using same 

20 volume of LB media and concentrated to 10 8 cells/ml and overlaid oh LB plates 
containing 2x 10 s pasganiiiated Streptomyces lividans TK24 spores. The cells mixture 
were grown over night at 30° C and E. coli cells were washed three times using 2 ml of 
LB media. The plates were overlayed using top agar containing NAL (nalidixic acid) and 
the appropriated antibiotic. Plates were incubated for 4 days at 30°C and transformant 

25 streptomyces colonies were isolated on HT medium (Pridham et al 1957) containing 
NAL and the same appropriated antibiotic. 
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Results 

Construction of a Transposon Tn <Apra>. 

E. coli aminoglycoside -(3)- acetyl transferase IV gene (aa(3)IV) was amplified by PCR 
and cloned in pMOD vector (Epicentre) . Advantage of this selective marker allow 
5 positive selection in E. coli and in Streptomyces lividans. Transposon can be used for 
insertional inactivation in vitro using purified transposase Tn5 (Epicentre). The structure 
of the pPl constructed vector and transposon was shown on Figure la,lb. 

Construction of a conjugative plasmid-transposon. 

Conjugative vector, transposon was constructed by cloning origin of transfer from 
10 plasmid RP4 into pPl vector producing pPLl vector (Figure 2a, 2b). The origin of 
transfer was cloned in such orientation that the selective aa(3)IV gene is the last 
transferred during conjugation. PPLl vector was introduced in to specific E. coli SI 7.1 
strain that carry RP4 plasmid integrated in to chromosome. In conjugation experiment 
between donor strain S17.1 carrying pPLl plasmid and DH10 E coli receptor we 
15 obtained DH10B strain carrying pPLl plasmid. This data shows that cloned oriT 
fragment is functional in the pPLl plasmid. Plasmid pPLl can be used for DNA cloning, 
gene inactivation by homologous recombination. Cloned genes or part of the gene 
cloned could be then transferred by conjugation in to another host. Another advantage of 
this vector is conjugative transposon that can be excised from vector and inserted 
20 randomly in vivo in to another DNA molecule by purified Tn 5 transposase. 

Construction of a conjugative, site specific integrative plasmid- transposon for horizontal 
gene transfer between E. coli and Streptomyces strains. 

PPLl plasmid was used to clone an integrase gene from phage <|>C31, resulting a plasmid 
pPAOI6 (Figure 3a, 3b), We tested several clones for horizontal transfer between E. coli 
25 SlT.l straE "and Streptomyces lividans TK24 strain. The best transfer was obtained for 
plasmid pPAOI6 were orientation of integrase gene is in opposite orientation to the gene 
for resistance to apramycine. Conjugative transfer of pPAOI6 gene in to S. lividans 
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strain is confirmed by resistance to apramycine or G418. Additional confirmation of 
transfer was obtained using PCR method. We were able to amplify 2 kb insert using 
specific primers for the <J»C31 integrase gene, and no PCR amplification was obtained for 
control S. lividans TK24 strain. 
5 The goal of this construction was to produce transposon that is further used in functional 
analysis of the metagenomic DNA library from the soil ttiat was constructed in a 
laboratory (Figures 4a, 4b, 4c). 

Functional analysis of the metagenomic DNA library from soil. 

The fosmids library consists of 120 512 clones, containing ~ 40 kb inserts of soil DNA. 

10 The library contains approximately 4.8 Gbps of die DNA cloned from soil. Ten percents 
of the library was screened by using a PCR approach for the presence of the genes 
involved in production of secondary metabolites (PKS). Using gene-module specific set 
of primers we were able to identified positive clones organized in microtiter plates (96 
wells). Sequences (based on PCR products) obtained from fifteen randomly positive 

1 5 clones indicate that the DNA library contains very little sequence redundancy limited to 
one and that the sequences were found to be new and very diverse in comparison to gene 
databases (data not schown). 

Two fosmids DNA was prepared from two positive clones (FS3-124 and FS3-135) and 
analyzed by sequencing. DNA analysis in silico shows high G+C contents of 72% and 

20 69% respectively of the cloned inserts and presence of cluster genes that could be 
involved in biosynthesis of secondary metabolites (Fig 5 a and b). No specific 
phenotype was observed for the two clones in E coli. We employed pPAOI6 transposon 
mutagenesis to produce conjugative mutants. Transposon mutants FS3-124::pPAOI6 and 
FS3-135::pPAOI6 were isolated using apramycine as selective antibiotic. Obtained 

25 mutants were then tested on LB pates containing chloramphenicol. About 1% of the 
tested clones are chloramphenicol sensitive. These clones contain transposon inserted 
into locus encoding chloramphenicol resistance gene and not into cloned DNA insert. 
ApraR and ChloS transposon mutants are then used for horizontal gene transfer into 
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Streptomyces lividans TK24 strain. Fosmid DNA was prepared from mutants and 
transformed into E. coli S17.1 strain. Horizontal gene transfer between K coli S17.1 and 
Streptomyces lividans was done due to inserted pPAOI6 transposon. Transconjugants of 
the Streptomyces lividans were tested by PCR to confirm gene transfer and integration of 
5 the conjugative fosmid in to & lividans chromosome. Both transconjugants showed an 
increase in doubling time, morphological modifications and pigments production in 
comparison to the control (Figure 6). 
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CLAIMS 

1. A method of analysing a library of polynucleotides, said polynucleotides being 
contained in cloning vectors having a particular host range, the method comprising (i) 

5 selecting cloning vectors in the library which contain a polynucleotide having a 
particular characteristic, (ii) modifying said selected cloning vectors to allow a transfer 
of said vectors and/or expression of the polynucleotide which they contain into a 
selected host cell, and (iii) analysing the polynucleotides contained in said modified 
vectors upon transfer of said modified vectors into said selected host cell. 

10 

2. The method of claim 1, wherein the library comprises a plurality of unknown 
polynucleotides. 

3. The method of claim 1 or 2, wherein the library comprises a plurality of 
15 environmental DNA fragments. 

4. The method of any one of the preceding claims, wherein the cloning vectors of the 
library are E. coli cloning vectors, preferably cosmids, P 1 or BAC. 

20 5. The method of any one of the preceding claims, wherein the selected vectors are 
modified by targeted insertion, into the vector, of a target polynucleotide construct 

6. The method of claim 5, wherein the targeted insertion is performed in a region of the 
vector distinct from the polynucleotide. 

25 

7. The method of claim 5 or 6, wherein the target polynucleotide construct comprises an 
origin of transfer functional in the selected host cell. 



36 



8. The method of claim 7, wherein the origin of transfer is functional in Streptomyces 
host cells, 

9. The method of claim 8, wherein the origin of transfer is selected from RP4, pTiC58, 
5 F, RSF1010, ColEl and R6K(ct). 

10. The method of any one of claims 5 to 9, wherein the target polynucleotide construct 
comprises an integrase functional in the selected host cell. 

10 11. The method of claim 1 0, wherein the integrase is selected from <(>C3 1 . 

12. The method of any one of claims 5 to 1 1, wherein the target polynucleotide construct 
comprises a transcriptional promoter functional in the selected host cell. 

15 13. The method of any one of claims 5 to 12, wherein the target polynucleotide construct 
is contained in a transposable nucleic acid construct 

14. The method of claim 13, wherein the transposable nucleic acid comprises, flanked by 
two inverted repeats, tiie target polynucleotide construct and a marker gene. 

20 

15. The method of any one of the preceding claims, wherein the cloning vector 
comprises a first marker gene and wherein, in step ii), the cloning vector is modified by: 

. contacting in vitro, in the presence of a transposase, the selected cloning vectors with a 
transposon comprising, flanked by two inverted repeats, the target polynucleotide 
25 construct and a second marker gene distinct from the first marker gene, and 

. selecting the cloning vectors which have acquired the second marker gene and which 
have lost the fik^Tm^kCT gene. ~ 
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16. The method of any one of the preceding claims, wherein, in step (i), the cloning 
vectors are selected by molecular screening. 

17. The method of any one of the preceding claims, wherein, in step (iii), the modified 
5 cloning vectors are transferred into the selected host cell by conjugative transfer. 

18. The method of any one of the preceding claims, wherein, in step (iii), 
polynucleotides are analysed by determining the phenotype or properties of the host cell 
upon transfer or expression of the modified vector. 

10 

19. A method for the identification or cloning of polynucleotides encoding a selected 
phenotype, the method comprising (i) cloning environmental DNA fragments into E.coli 
cloning vectors to produce a metagenomic library, (ii) identifying or selecting cloning 
vectors in said library which contain DNA fragments having a particular characteristic of 

15 interest, (iii) modifying the identified or selected cloning vectors into shutde or 
expression vectors for transfer or expression in a selected host cell, (iv) transferring or 
expressing the modified cloning vectors into said selected host cell and (v) identifying or 
cloning the DNA fragments contained in said modified cloning vectors which encode 
said selected phenotype in said selected host cell. 

20 

20. A transposable nucleic acid construct, wherein said construct comprises an origin of 
transfer flanked by two inverted repeats. 

21. A library of polynucleotides, wherein said library comprises a plurality of 
25 environmental DNA fragments cloned into cloning vectors, wherein said environmental 

DNA fragments contain a common molecular characteristic and wherein said cloning 
vectors are E. coli cloning vectors comprising a target polynucleotide construct allowing 
transfer or expression of the environmental DNA into a selected host cell distinct from 
E. coli. 
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22. A polynucleotide sequence comprising all or part of SEQ ID NOs: 1 or 2, or of their 
complementary strand, 

5 23. An oligonucleotide comprising SEQ ID NO: 3 or 4. 



ABSTRACT 



The present invention relates to methods for the identification or cloning of 
polynucleotides encoding a selected phenotype, particularly from environmental DNA. 
In a specific embodiment, the method comprises (i) cloning environmental DNA 
fragments into E.coli cloning vectors to produce a metagenomic library, (ii) identifying 
or selecting cloning vectors in said library which contain DNA fragments having a 
particular characteristic of interest, (iii) modifying the identified or selected cloning 
vectors into shuttle or expression vectors for transfer or expression in a selected host 
cell, (iv) transferring or expressing the modified cloning vectors into said selected host 
cell and (v) identifying or cloning the DNA fragments contained in said modified 
cloning vectors which encode said selected phenotype in said selected host cell. 
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CDS 76.. 1134 

/note= «ABC_t ransportr ■ 

/gene="TAP2 PROTEIN" 

/blastp_match="Oryzias latipes" 

/blast_score= 0.002 
CTS complement (1096. .2430) 

/not e= n none " 

/blastp_mat ch= " Anabaena sp" 
/gene="AIiR1117 PROTEIN" 
/blast_score=2e-l8 

CDS 1178.. 1624 

/not e= «Graraj?os_ancnor " 
/gene="CEI*Ii WAliI* SURFACE ANCHOR" 
/blast score=ie-04 

/blast£_match==«Streptococcus pneumoniae 
complement (2506. .3567) 

/not e= " CONSERVED " . . r 

/gene= « HYPOTHETICAL PROTEIN" 
/blast score=0.019 

/blastp_match="Deinococcus radiodurans 
complement (2906. .4222) 
/note=«glycosyl transferase" 
/gene* " lipopolysaccharide" 
/blast_score=2e-23 
complement (4092 . . 5321) 
/note="glycosyl transferase" 
/gene«"glycosyl transferase" 
/bias t_score= le - 1 5 
complement ( 63 37 .. 8502 ) 
/not e= " PUTATIVE " 

/aene="GLUTAMlNE AMIDOTRANSFERASE ' 
/blastp^match-" Bordetella bronchxseptica" 
/blast_score=le-l6 
complement (8181. .9530) 
/note- "none" 

/ gene = " MEMBRANE PROTEIN" 
/blast__score-0 . 035 
complement (9531 . . 10721) 
/note="NOEC Transmembrane" 
/gene=s"NODULATION PROTEIN" 
/blastp_match«"Azorhizobium caulxnodans" 

/blast_score=3e-07 
complement (10504. .11274) 

/note* * PUTATIVE " 
/qene= "HYDROIiASE " 

/blastp_match»"Streptomyces coelicolor 
/blast_score=4 e - 14 
CDS 12874 .713689 

038 /note= "HYPOTHETICAL Meth-transf " 

/gene=" PROTEIN PA1088" 
/blast score=2e-06 

/blastp_match=" Pseudomonas aeruginosa 
CDS 14195.. 15976 

COb /note=«PUTATIVE / Glyco_transf» 

/note= " " 

/gene=»IiIPOPOI.ySACCHARIDB BIOSYNTHESIS" 
/blast_score=2e-0S 

/blastp^«atch= "Vibrio cholerae" 

SS^pathLy, inner core lipopolysaccharide 
biosynthesis" 

/gene="PHOSPHOHEPTOSE ISOMERASE 
/blast score=3e-17 

/blast£_match= "Helicobacter pylon" 
CTS 15579.. 16253 
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/notes "none" 11/35 
/genes" PHOSPHOHEPTOSE ISOMERASE" 
/blast score«2e-22 

/blastp_matcbs"Neisseria meningitidis 11 
cds complement (16505 . . 17656) 

/not e= "BIOSYNTHESIS PUTATIVE 0 
/gene="LIPOP0LYSACCHARIDE" 
/blast scores2e-17 

/blastp_matchs«Thermotoga mar it una" 

/pf am_mat ch« "Glycos _trans£ _l ■ 
CDS complement (17657.. 18697) 

/notes "none " 

/gene="AI*R3073 PROTEIN™ 

/blast_s core= 6e - 2 7 

/blastp_match= " Anabaena sp" 
CPS complement (18615.. 19304) 

/note= n none n 

/gene="ALR4487 PROTEIN" 

/blast_score-8e-07 

/bias tp_match= "Anabaena sp" 
CDS complement (19301 -.20596) 

/note= " ATP _GTP_A" 

/genes "ABC TRANSPORTER" 

/blast_score=3e-61 

/blastp_match="Synechocystxs sp" 
CDS complement (.20535. .21476) 

/note= "PERMEASE COMPONENT" 

/gene= "POIiYSACCHARIDE ABC TRANSPORTER" 

/blast_score=6e-4l 

/blastp_matchs«Clostridxum acetobutylxcum" 

%Se^ 2 l-d 5 iA 2 S X U b esia of a polysaccharide 
capsule ?" 

/gene="32.3 KDA PROTEIN" 
/blast_scor e=3 e - 17 
/blastp_matclia"Sphingomonas sp" 

CDS 23155.. 26523 

/note="peptide syntase" 
/gene="mcyA, mcyB and mcyC" 
/blastp_matcn="Microcystis aeruginosa" 

/blast_scores0 . 0 
CDS 26409.. 34433 

/notes "polyketide syntase et peptxde syntase" 
/genes "mcyD, mcyE, mcyF and mcyG^ 
/blastp_match="Microcystis aeruginosa" 

CDS 34418.. 37500 

/notes n CYSTATIN" 

/genes "PEPTIDE SYNTHETASE" 

/blast_scoresO . 0 

/blastp_match= "Anabaena sp" 
CDS 35359.. 37500 

/note= "gene cluster" 

/gene= "nostopeptolide biosyntnetic" 

/blast_scores2e- 4 1 

/blastp_matchs"Nostoc sp" 
Sequence 37500 BP; 6199 A; 12698 C; 12769 G; 5834 T; 0 other; 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 60 

SfttSfltS Igtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 120 

aat^Stl SactcLta tagggcgaat tcgagctcgg tacccgggga tcccacgtac 180 

cacaSctca tctggaagag cccgctccag acacccgagg aatggaagcg cggcgtcgtg 240 

aaaccSaca cgca|aagcj tctcgtcgcc ttcgttgact ggcttgcgac gacgatgaag 300 

SStca ccaglat|t? cgccjcalgc agctcgatgg gcggatcggg cgcgatcatg 360 

ctcggattc gctScccgc algattcgcg tggaccgtgt cgtgggtcgg cgtccacgtg 420 

ccclccgact ctccgcaatt cacatcgtcg tacgagctgg tgtacggccg ^cccgacttg 480 

ZZl^^ZZZ fcaaaaacoq cacgccggtc tgggatcatt ttagtgacgt ctggtacctg 540 

cScalcaS cgSKSa catcgggtc a?cacgttct cgaacggcaa gaacgactcg £0 

aSatcSct Scgccallc cgtcjalttc ctgaagacgc tgcaggagac caggcaaccg 660 

cacctg??cg tctfgggcca glaglgacac ggccagcgcg cgaagatgcc ggaaggtggc 720 



ggcgaacggg 
tgcacgctcg 
atcaactcct 
gtcgtcatca 
cggcggctgc 
ggcaacaccg 
ccgcagatgc 
ggcgtcctcg 
gtccgcgcgc 
agccgcccgg 
aagtccccct 
cactttgttc 
gagatactcg 
tgcccgcacc 
gcgaaccacg 
gtagccgaac 
aaacatctcg 
ttcgagcagc 
gactttctcg 
cgccgccgtc 
gaaggggatc 
gagcaccgcg 
cgaatgcagc 
gagattccgg 
cgctcgatgc 
fcegtgcgagc 
gggcagccgg 
acgcgggccg 
cccgccgcaa 
caggtgccgg 
gtgaggtcgc 
aggacatcgc 
tcgtcgagct 
tcgcagtgtg 
gggatgagcg 
cgccgcacgg 
agtcccttgc 
gggctccgta 
cgaacgataa 
cgggattggc 
tgaccagggc 
cgatgggtgt 
acgactgcac 
ggaaacgcca 
ggctgccgtc 
aggcttcgat 
cgaagtcgct 
gtcggaaagc 
ctttggccag 
cgagaatccg 
atccatgggc 
gatcggtctt 
ggagcttcga 
tgccggccgc 
agcagacggt 
tctccggtcc 
cgattcccgg 
gcccgggcga 
actgcgccgc 
agcaggagcg 
ggaatcgaga 
atggcgagcc 
aaaaacacgg 
gacaggatgc 
tcgaggctgg 
ccgctcggga 
gcgcgctgca 
cccggggtct 
gcgtcaacga 



agatgccgct 
acgacgatcc 
acgtcacctg 
agctggcgaa 
agcagttcaa 
tcgtgcagcg 
aggtctcaac 
acgagccggc 
gcggcgatgt 
aacgcctcga 
gcgacggccg 
tgcacgcctc 
cgtacctccg 
gcggcggtcg 
gacggccaga 
acgccgcaga 
acgtcgatgc 
aggcgctcgc 
aagcggagca 
gacgccagcg 
ccgcccagga 
tcgggccgcg 
aggacgtgcg 
ccgcggcgga 
gaccactcct 
gtgtgaatga 
tgtgtgagca 
gtacgcccgc 
tcacggaatc 
cgccgatccg 
tcgtgccgtg 
gctccaggtg 
tcgcgccggt 
caagcacgcg 
agagccactg 
agtaggagag 
cggcgtcctt 
tcgtgtgacg 
ctcttcttcg 
catcgcctgc 
gtgcgtgccg 
ctcgaacgcc 
gagcaggtcc 
ggttccgggc 
gccggcgatg 
gagcaagtcg 
ggcctcgaat 
atggtggtcg 
gagctcccgc 
gcgatcgagc 
cgtggccagc 
gtactcgtgc 
ccagattccg 
tttcgcccgt 
gatcgcgtac 
gccgcccgtg 
cctaggcgcg 
tatcgtacgt 
gccggcgccg 
cggtaccgac 
cgagcgccag 
ggtgacggaa 
cgcgaagcga 
ggccgccgcc 
tatcgaagcc 
gccagtccac 
cccagttgcc 
ggagcagatc 
acgccagcac 



cgacctcaga 
gggagacggg 
gcagcctgac 
tcgaccgccg 
gctcaagccc 
cggcgaagcg 
gggtggaaac 
gcatcggcgc 
cccgtcgcgc 
ctgattccgc 
gggtcacgac 
tcgcgatcgc 
ggaccgcgcc 
ggtccatgcc 
cctctcgcgc 
agacgacccg 
cgttcggcac 
gctcgctcac 
agcggccggc 
cgcgccattt 
tcggctcgca 
tcgcggcaat 
tgaggggtcg 
cgcgcacgac 
cgtcgtcgtg 
gatgaaacgc 
gaaggactct 
ggcgatcacg 
cggaccgacg 
caccatccgc 
cgtgtccggt 
gacccagccg 
ctgcgaaaag 
gccgaggaaa 
cgtcgaccct 
caaggcgggc 
gaggctcacg 
agcgagtcgt 
atccggcgtc 
tcgatcgcgt 
ttcgacacga 
atggcttcga 
agcccgtgat 
ccgagctcgc 
acgagacgga 
aatcgtttct 
cccagtcctt 
atgccattcg 
cgaatctgfct 
gggtagtaga 
ggaatcacgc 
gcgtgcacga 
gtgtcgaacg 
tcgtccatgc 
ttggagcgat 
ccccgcacgc 
gcgtctggca 
gccggccacg 
gatcatcgcg 
aatccaccac 
cagatcgagg 
gctcacgcgc 
tcggggatcg 
ccggcgcagc 
accggcgcgg 
gtcttcggtg 
gtcaccggga 
gacggcgacc 
gtcgccctcc 



acgaaccaga gcctgccggc 
tcggattcga gcggcgtccc 
tcgatcgtcg acgaacccgg 
cggtcgcccg ccgcagtcga 
ggcgatcagg tgacatggac 
gtcgcggacc ggtggggtct 
aggatcctgg tgagcaggaa 
cagttgatgt gcccacgaaa 
ctgcggcggc tttgccagca 
cacggagcac gccggccgtg 
gttcgggagc cccgcggcaa 
aatcggcgcg accgcgatcg 
ggtcacttca atcgtgggat 
cacgagcatc agccgcgcgt 
gaaccagagc gccgcgcgct 
cgcttcggcg gccggtggtc 
gaccgacgcc gtgaggccgc 
gaccgtggtc gccgccgcgc 
ctcgcggcgg aacagccagc 
ctcggagtcg acgtccacca 
ggcgtagcgg gccatcccgc 
ccgtcgttcg agcacctggt 
atcgcccggc agcgccagac 
gtcggtcgac gccgtcacgc 
cgcgagcgag acgaggtgca 
gcgaatgcga tcgccccggt 
caccgtgcgg catccgatcg 
cgggcgggaa tcggcgcggt 
tcagccatga ccacggcggc 
ggtgtgcctg gctgctcacg 
ccgctgggga tgtggacgcc 
agatggcatc cgggtcccac 
atggtgccga aggcgaccgt 
gccgcgcgaa ggtattgtcc 
tcgagggccc ggtcggcgcc 
aggacgagca ccagcgcgat 
cccactgcag cgcgcgcccc 
agagcgcttc cactttgcgc 
gcgccgcaat cgcacgggtg 
ccgcgaggcc tgacgcccgg 
gctcggccgt gcccccggcc 
gcacggcgtt gggcgtccct 
ggacggccga gacgtcggac 
gggcggcctg ttcctcgagc 
ggcgcggcca cttcgcctga 
gcggctcgag acggccaacc 
cacgcgccgc ggcctcagat 
ggatcgtgac gacccgatcc 
cggacacggc aatgacggca 
cggtgcgttc gcgaaacgaa 
gctcggcgcg tcccaggagc 
tgtcgatgcg cagatctttc 
aatgcttctc ccggacctcg 
cgaacaccgg atcgcgctgg 
cggcctgcgc cgttccgagc 
tgcggagctc gagcacgcgc 
gcatggcccc cccggccgac 
acgagcgact gtgcgatctc 
gctgcgcgca agccggcgaa 
ccgccgcgct gccacagcgc 
accgggatga tcacgctggg 
-aggttgfccgc ggccgcgcca 
ccctggtgca cgctgcggag 
cgctggcaga tatccacgtc 
tcgaaggcct cgcgcttcac 
ctctgcccac gacggcgaaa 
gcttcgtact gcgcgccggc 
accggccagc gcgggtcgat 
gccgaccggg ccgcgcggtt 



gttcagccgt 
ggccgggcag 
ccgatggagc 
cgtcacgccg 
gaactccgcg 
cgtcacgctc 
ctaggcgggc 
gctggctgag 
aatccagcac 
cgatggtggg 
tcgcttcgag 
ccgatcgcca 
cggccgcgag 
ccgggcgggc 
cgttgggctc 
ctttcggcgc 
ccgtgacttc 
gccgcatcgc 
ctttcggcca 
tgtccaggac 
tgcagtacgc 
agatctccgg 
cgctcgcgat 
cggcgagatc 
cgtcggcgga 
tcggggcgta 
ctctcggatg 
cacgaccgac 
gctgccgatc 
gatcggccgc 
ggcgccgacc 
gtacacatgc 
gaccgacggg 
gatgacgccg 
gaggaagggc 
gccacgggcc 
gcgtcggacc 
atccgccgct 
cgtccggccg 
ccgggctgaa 
gcggtcgcga 
tcgtagtccg 
gtgtggccga 
ggacgccgga 
agggcggcac 
gcgccgatga 
cgtggatcgc 
ggccgtccgc 
gggaagcggg 
tggccggtcc 
agggcgagga 
acgagcgcgc 
acgtagtcga 
tcgcggatgt 
agaatcgtct 
acgggacgcg 
gagcgcgatg 
gacgagtccc 
accggcaagc 
gaggcccagc 
cagatcccgc 
..cagetcgccg-. 
ccgcgggtcc 
ttcacaggtc 
ggcaagattg 
cgagtcgtac 
agcgacgacc 
ctcatgatcg 
gcgaagcgtc 
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840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 

___444l0„_ 

4500 
4560 
4620 
4680 
4740 
4800 
4860 



gccacgttca 
gagtcgtcgg 
gtgcgatgaa 
acgaacgaca 
gctttcgtag 
cccctgctct 
cgatcgcctc 
actctgcggc 
ggatctgccg 
cgatcgtcca 
cggtggacgc 
tgttccgcac 
tgatccccgc 
cgatgagcgt 
ccgattgcgg 
ccaggatgtc 
agcacgccac 
attgctgtcc 
cgccgcgcac 
cgtcagtggc 
gaatgacgac 
tgctggccat 
tgacgcgttc 
tcgccggcgc 
gaagcgccac 
gtgttcgcgc 
tctcccgtga 
cgtgcgcgcc 
ccagcgatcg 
gttgaccttg 
cgagttccgc 
cgtgtcgtcg 
gaacggcagc 
ccgatgatgc 
gtcgaggagc 
tgtgaagagc 
gtcgatgaac 
cgccaggctc 
gtcccgcagg 
ggtgagcgag 
ctcgaagaag 
ggcgcagccg 
cagcacggcg 
gagcgacttc 
cgtcgtacgc 
gcctgacggc 
cagggtcttg 
gagagccggt 
tccggccgcg 
cactcgctct 
catggcaggc 
cgaaccaaac 
cggccggctg 
gaccgcccat 
cgccgctgac 
gacgccgagg 
ggtgtcgtct 
gcaatcgaca 
ccgcccacgc 
gccagggccg 
aggagagtga 
cacgcgctgt 
taggcggccg 
agcgcgaccc 
tcgctcgcgt 
tccgtcatca 
ttgcgcttcg 
cgggagtacg 
ttggcccatt 



cgtcgggcag 
tcgacccgtt 
tcgacgccag 
cgcgcggcgc 
cgcggcccgg 
cggccgccga 
cctggcggct 
ggcgttcgtg 
gctcgtcagg 
cggcgtcgcc 
gttggcttcg 
gatgccgcgc 
ggccaggccg 
gtcctgcgag 
gccgcgaatc 
cacgccgtcg 
gaggccatcg 
cgcatcgatg 
ctgaatcgcg 
gccaaccgag 
gcggtcccga 
ccggtcgagc 
gagcgcctgc 
cgcccatccg 
gagtgcgccg 
aaccagagct 
cgcgtgtcgt 
tccggggcga 
aagtcgtgat 
tcgaacacct 
accttgagca 
cgccatgccg 
cgcacgtcca 
tcacgaagat 
gcgtgcagcg 
gtgtccagcg 
tcgccgcggc 
gtcttggcga 
aaatcgatcg 
accatgcgcg 
cggtggtcgg 
gcgacgccca 
cggctgtcga 
tcgaccgcgc 
ggtccgcggt 
cagtcgtacg 
tcgcccagga 
ttgcgcgccg 
tcgacccgcc 
cgcgccggat 
cccagccgtt 
tcgctccgga 
acgtcgcgcc 
cggtccgacg 
gcggcgtcgg 
aggccgttca 
gagcggccgg 
agccgaggaa 
agaatgcaat 
gatcccgctt 
gcaggaacgc 
gaaccgagcg 
tgaaggcatt 
tccagaagtg 
cgatgctcga 
cgagcgacat 
agtacaggaa 
tgctgatgcc 
tcctcgcttc 



cgacagcact 
gtccgccacg 
gcaacggctg 
cacggcaatt 
gttgggggcc 
tcgaaatcga 
gccgacaggt 
acgttgttcg 
ttgttgcgga 
gtgccgagcg 
atcgcttcgt 
tggtgafccga 
atggcgatgc 
ttcgcccagg 
cgcaggaaga 
gtgtaatcgg 
tccgcgaagc 
agctggacgt 
tggtacccga 
agcgccacgc 
tcgccggagc 
cggtactcgc 
agcagctcgg 
cagaagtgcg 
gccacgatga 
cgaggatgag 
cgatgatggc 
acagctcgcg 
agtgacggta 
tctccgccag 
gcgccgggtt 
gatcgccgct 
cgacggtgcg 
agagatagct 
agtcggccgc 
gcaggccgct 
ccgtcatcgc 
gctctccgcc 
ccagcatctc 
cctggttcgg 
taccggtgag 
gcgtgtacgt 
ggccgccgga 
gcctgaacgt 
agagatcggc 
cgagcacggc 
cgaagcccag 
aggtcctgag 
agtagatcgg 
cgatcacggc 
cgtacaccaa 
gcgccgcttc 
cggccgccac 
gatctcgata 
cgcggctgaa 
tcagacgacc 
aaccggccgc 
gtgccagagg 
gagggcggtc 
cgagacccgg 
cgcgaggccc 
ccccttgccc 
gtgaccgacg 
cagccggctc 
gtcggacgag 
cgcgacgaag 
atagacgagg 
gcgatagagc 
cgtctgcatc 



cgcgcggatc 
acgatctcgt 
aggtgcgccg 
ttcgcggtca 
gcgagaactc 
gcggcgcctg 
gcagatcgca 
tcgcctccgc 
tgagcccgtt 
tgaagacgct 
cggcccattc 
tctgcttctc 
ccctgaacga 
cgaggatcgc 
cgttgtcgcg 
acggcgcgct 
gtccgtcagt 
tgtgcaccgt 
cgtggccgac 
cgatcgcgtc 
ggctccggag 
cgttctcgag 
acgtccgccg 
gcaccggcgc 
ggcgagtcat 
caggatttga 
gcgcaacgtc 
ctcgacggct 
cccaggcgcg 
cggaccggcg 
gccggcctgc 
cagcaggacc 
gaagagatcg 
gctgagctcc 
gccccgtccg 
gctgatgtag 
gaacacgtgg 
gtggccgcgc 
ggtcagtccg 
cagaaagtcc 
cttcgacagc 
gatgagcccg 
gagcgacagc 
cgtgtgcacc 
cggctcgaaa 
gccggggtcg 
cgtgacgtaa 
cacggaagcc 
atacgagccg 
gagcgaaaac 
agccaggaga 
gttgtagaga 
ctcgtcgagg 
gacgaccgtg 
ggcggtagat 
ccgcgggcag 
gcggcgtgcg 
atctcggtgt 
tggacggcca 
cggaccctgc 
acgaacccaa 
caggcgccgt 
ccgtaaatcg 
tgggccgatc 
ctgatcgagc 
accgccgccc 
ccaagcacgg 
accccgatga 
agcgtgacga 



cgcgctctgc 
accagccagc 
cgccgttctt 
tcgatttcgt 
gaacgcgccc 
cggaatctcg 
cgaggccgcg 
ctcaccgccg 
cgtcacggga 
gttgtgctcg 
gttcaggttg 
gccgtcgcgt 
atcgacgatg 
cgggccggat 
gatgacccac 
cgtcgtgtac 
gccgacgctg 
ggctcgggac 
cgtcagatcc 
gccggtcatg 
cacggtgccg 
cagaatcgtg 
cacggtgacg 
ggcggcctgg 
gccgccaccg 
aggaggtagc 
tcggggcgca 
tcgagcagtc 
ttcaggcgct 
tcgccgcgcg 
gtgatcgcgc 
tgcaggaact 
agcgagggaa 
gcgggtgaga 
gagaggcgcg 
ttggcgcgcg 
ccgtcggtgt 
agcaggacct 
tggctcagat 
ttgaggtacc 
tgttgcgcga 
ccaacaccgt 
ccgaagcgat 
gcgtggacgt 
taccgggaca 
aggagcgtga 
tcggcggccg 
agatccgatc 
aacggatccg 
gcgccgtcga 
tcggcgcgcg 
tcgccgcgaa 
aagccgagcg 
cggcgcggcc 
gccggctcat 
acacgcgcga 
ccgccaccat 
actgaaagat 
gcgcgtacgg 
ggcaggcgat 
gctcagccac 
ccgacgtgtc 
gccggtcatc 
gctgatcgcc 
tcatgcgatc 
cggccacgag 
cgagcgtgag 
gcagcacgcc 
tgagcggcac 



cacggcacgc 
cggcgcctcg 
gacggggatg 
agcgcggccg 
agatccggcg 
atgccggcgt 
tttgcgaacc 
ttacggcgca 
aaccggacgc 
atccggaagt 
cagatcacgt 
gcgtaggccg 
acgttccgtt 
cgccagccgc 
ccccgggccg 
tcgaagcg.cg 
cctttgagca 
gcgccaagct 
gcaatcgtca 
ccgccgccgc 
gggacgttga 
gtgcgcggct 
acctgtttcg 
ccggacggca 
tgtcgacgtc 
cgcgcgactg 
ggaacatccg 
gcgtctgcag 
tcagcagggt 
cgccggtgtt 
gatgaagcgc 
cgcgatccat 
tcgtgaagcg 
gctccacggt 
cagcgtctgc 
acgtcatgta 
ggagcggcca 
cgataccgga 
acatgccatc 
ggtcgtcgag 
tcacctgatc 
tcaccgcact 
gcctgcccga 
acgcgtcctt 
cgctcgcggc 
cgcccgccgc 
cccggaggct 
cgaacgacag 
ttgcaagcag 
gccgggcggc 
tcccgttcgc 
agatcaccag 
tgccgagccc 
tgcccgactt 
cgcggcaaac 
cggaacaaga 
ccggttgagc 
gacgaacgtt 
cgccagctcg 
gagcgccagg 
gagaccgaac 
gtagtcgttg 
ggccatgcgg 
aaccgccgtg 
ccagaactct 
gacgagcatc 
gaagccgccc 
cagcgcccac 
gagcatgaac 
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4980 
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5100 
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7080 

7140 

7200 

7260 

7320 
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7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 
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8040 

8100 

8160 

8220 
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8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 



atgccgacgg 
ccgggaatga 
gagatcacga 
tacgtcatca 
ggcgacgcca 
gcgtcgatcc 
aggagcgaga 
tacgcgatga 
ccgaaggcga 
tcggtcccga 
agggccggaa 
gcgaagaacc 
atcccgatgt 
tcgaggtgat 
accatgctca 
gcgaccgacg 
gcgcccacca 
gggaaccagc 
tacggccact 
gccagggccg 
cccacgagga 
cggtgccgtt 
tagttgctcg 
gtccagccga 
gcgttcttga 
gagctagcgg 
gccggctgtg 
tccttctccg 
ccacatctca 
ggcggtcgtg 
ctcgggcgcg 
cgcgccgagc 
ccattcgttc 
cgcgatctgg 
cgcccggcgg 
cacgagcggc 
cagcgtgccg 
aggcccagca 
acccacacgg 
cagctcccgc 
acgagcggca 
acgatcaggc 
ccgatgacgc 
aagaacagcc 
tccagccccc 
gccttcatgt 
cggctgccgt 
gagatcgcgt 
agcccgccga 
cacgggccga 
cgaagcggga 
atgatcgagc 
gccgagtgcc 
aggagtgtga 
tagaacgcga 
atgagataca 
gtcagcgatt 
gcccggatgg 
ctggcaagcg 
caaatcaggc 
tcccaatggg 
gatgggagtg 
caccggggaa 
ttttccggcc 
cattctggga 
tggggatggc 
gagctccaga 
cagtcaccgc 
cgcgcgcagg 



cgacgccgtt 
acaggagctg 
ggaagacgag 
cggtggactt 
gcgtcgacga 
gcagcgcctg 
tgttcatgtg 
acaggtagaa 
ggagggcgag 
tggtccagag 
tcgaggtgaa 
cccgctgctt 
gcaggtagta 
agcggacgat 
cgagcagccg 
cgttgccgat 
tccagtacga 
cgaggaggag 
ccttcgtgcg 
acagcgcgaa 
tccactcggt- 
tctccggatg 
aggcgatgag 
agagctgcgg 
accagtgatc 
ctgacgacat 
acgccgtggc 
tccacgtccg 
cacgccacat 
cagagcgcca 
tagtccgaca 
aaccccgggc 
accgtcgccg 
cgcgccgcga 
tactcccgca 
gtggcggcga 
tccacgtcga 
tccgcagaat 
cagccagccc 
gccaggcgcg 
ggtgataccg 
tcaggagcag 
cgagcaccat 
accgcggcgg 
agaagtctgc 
agtcgagcgc 
ccggcgcgcg 
cccacatgcg 
gcgtgtccac 
cgatcgcggc 
tgcccagacc 
gaacgagcgc 
gccctgatgg 
tgaagagcgt 
acgccgccgc 
cgagccagac 
ccgagcccgt 
acgtcggttc 
tcgcgtagtg 
cgagccggag 
tgcgctcggc 
ggggagcctg 
tcagcaaaga 
ggaagtcggc 
tgacttttgt 
ataagcttgg 
aggcctatcg 
tgggggcgct 
gcatccggcg 



gttgtcaccg 
accccagccc 
ccgcaaccgc 
catgaagtcg 
caggctgtgg 
cccggacagg 
gatgagaaag 
cagaatcgcg 
gacgaaagcg 
cggctgcgtc 
catgagcacc 
gtagagcttc 
ggcgaacatg 
gaagatgccg 
ctcctcgttg 
gtgccggaac 
gatcacgagc 
ccggacggcg 
aatgggcggc 
gtatctgttc 
gtagcccgcc 
cagcagatct 
gcaggtcgcg 
ttcgtagaag 
gatccgcgcg 
cgttgagctc 
ggccggcgat 
gtcgatcccc 
ggaagccgcg 
gcgaaaaatg 
gcacgccgag 
gccggcacca 
ccacgacctc 
gcggctcgtc 
cgatgcgcgc 
gctccgctgc 
acagcacggc 
cgccgacacc 
cgccaacagc 
cgtgtcgatc 
gggatgtccg 
caggtgcgcg 
gaccgccgcg 
acggtagaac 
gaacttcagc 
ctggcgctgc 
aggcggcatc 
atcctcgggc 
gaccgtgaac 
gtatccggcg 
gacgagcgcc 
ggacgcgccg 
gcgatcgatg 
ctcggacagc 
ggcaatgagg 
accgagcagg 
gaccgcccag 
ccccggcccc 
ctgctcgtcg 
gaccagggcc 
ctcgtcgcgc 
.catcafctcgc 
ctctgccaac 
ccccaaggcg 
ccafcgatcca 
cacctgcgat 
ggacgaccgg 
cctccattcg 
cgcggcggag 



agcgagtaga 
tgcttggcgg 
ttgacgtccg 
atcatgtagc 
agcccgaaga 
aacgcgatcg 
tcgctccaca 
tagaacggcc 
gctcgaagca 
cgggtctcct 
acgaaaagaa 
tccggattct 
cccgccgcca 
ccgaagaggg 
tagaaggcaa 
tccgcgtacc 
gacatcggcg 
ttgttcgccg 
acgttgtacg 
acgagcagcg 
gccggcttga 
ctcgggccgt 
gcgagcgcga 
aacgccagca 
atctggacgt 
ctcgagccgc 
gagggcgcag 
gacatacagg 
cgggtgcggc 
ctgcgacgtg 
cctgagcccc 
ccgcagatac 
gttgtccagc 
ggccgcgccc 
cgtgtccact 
catggcggcg 
cttgaccccg 
tttgacgcgt 
accgccgctg 
gcggcggcgc 
aagacgatcg 
cgccagttga 
tacgatccca 
ccatttccga 
accgaacggc 
gcccacttct 
gtgcgcgacc 
gtgtacgcgt 
gagcgttgca 
aagagaagca 
gcggccatga 
agcgcgacgc 
agcgtcagac 
accagcacgc 
ccggcgcgtg 
cfccaacggaa 
acgcccgcga 
cacgcgaacc 
cgaatctcga 
agcgtgagga 
ccgggggcga 
.taaaacaagc 
gggtccgccg 
acccggctgg 
tgaccatttc 
ggcgacgtgt 
gtcgcccgtg 
cgtcagatcg 
atcgcacccg 



cgtcgttgaa 
cctcgaaccc 
tcgtgaggac 
cccacgcgaa 
aggcgaggag 
cgagcgtgta 
cccagagctc 
cgcgcagcgc 
ttgccgtgtg 
ctacattgaa 
tcgccgacac 
gcacggggct 
ccggcgcgaa 
cgcagcaggt 
acgacttccg 
gcttcgtcgc 
gcacgcgatc 
actcgctcag 
tgacgccgag 
cgagcacgaa 
tcttcccgga 
ccaggagctc 
gcgcgagcgg 
gcacgccgag 
acggccaaat 
cggatcgcga 
gagagtcctg 
acctcccgag 
ttcagggcgt 
ccgagcgcgg 
gccgcgcgaa 
ttgagcggcc 
ccgagcgcct 
aggcgccgga 
ccgcgccgcc 
cgcaacggac 
cgcaacggct 
cgcgaaggaa 
ccgcgcccgc 
cgtagcagag 
cgtgaatgcc 
ccggccgcgc 
tgatggacac 
cgccggcgat 
gaagcgtcgt 
ctttcgtgcc 
attccttgtc 
agttgcccat 
ggagcgtgtt 
cggccacgcg 
ggatcgcgac 
ccgtcgcgag 
atcccaagag 
cggagaacag 
cgctgaacaa 
cctgcgccag 
cgaacgcggg 
cgttcccggc 
gcgcgacgtc 
tgacgagaat 
cgattggcca 
agttacggtc 
gcggcgacgc 
gcacagccca 
gaggggcctt 
cgatgatcag 
aatacgtggc 
gggtcgtgcg 
gtccagcgcg 



gttcttgatg 
gagggagagc 
tgcgagcagg 
gtcctggtag 
cgccagccgc 
catgcccgcc 
cggccggaag 
gtggaaggcg 
ttacctgtta 
caccttgtag 
caccatgtag 
gttcggcagc 
caggatcagc 
ggcgtagaac 
gtacgaggcc 
catgaagaac 
ctgaatcagc 
cacgtcgagg 
cacccagagc 
tccggccacg 
gggcaccggc 
gttcagcacg 
cgggacggcg 
cagcatgaac 
ccgcgaaccg 
accatccgcg 
cggccatcgc 
gtccgagtga 
tcacgttgct 
cgagcttgct 
gcgagtcgag 
ggcgcaccat 
gcgcggtgcg 
gatcctcatg 
accggaacgt 
cctgccggta 
tcactctgcg 
cacgtcacga 
tgccgtccgc 
cagaatcggg 
gctcatgaac 
gaacgagatg 
cgccccggcg 
gaagtcgcgc 
tcccggatgg 
ctccgtccag 
gcccgtgagc 
catcaggttc 
gcgcacgctc 
cagacgcgcg 
gtacggccac 
cgcgatccac 
caacaaaaga 
cagcgaggga 
cctgcgcccg 
ccggaccgcc 
aaagagcggc 
gacgacgttt 
cttgaacccc 
tgcccggcgc 
gggggccgac 
aaacctgtcfc 
ccggcgcgcg 
gtagaaatct 
cgtgacgccc 
cggccgtgag 
gcggcggttc 
agagctcgtc 
gctgaccgtg 
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9120 

9180 

9240 

9300 

9360 
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9480 

9540 

9600 

9660 
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9780 

9840 

9900 

9960 
10020 
10080 
10140 
10200 
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10320 
10380 
10440 
10500 
10560 
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10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 
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12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 



gacatcgcgc 
gaagcgcgcg 
gacgcgtttc 
cgccatttcg 
cccggcggct 
aacgcgaagc 
gacgagctgg 
cccacgctca 
gcgatggagg 
cgcgggtgac 
tgaacgccct 
agcgcagccg 
cccttcgggc 
gcgggttctc 
tcaaggaaga 
gccgcgctcg 
atcccggcgt 
tgctcttcgc 
tcgtcgagct 
gcgacgtcga 
tctcgtacga 
tgtgcttcgc 
tgggacggcc 
gcgccggcat 
gcgccggctg 
tcggccgctt 
gcgagcacgc 
tcagaaggcg 
gcgggcgaat 
cgtgcggctg 
cgcgcagtac 
ccggcaccac 
gtacgactgg 
cgggctctcg 
caccggccgt 
ctgtgagatc 
ggtgaccgcg 
cccggagaac 
cgatgagggc 
agatcggttt 
acagtaccgc 
gctcgacgag 
ccccgtccat 
gacgttcagg 
gccgtgcttc 
gaccagatcg 
ggccgcaatc 
ttcgcgcgca 
cggtggccga 
tgtgcggcaa 
gcttcgagaa 
fccaccgcgtg 
ggagcgccgg 
gccgccatgg 
ggcggcaaga 
tcgaggatcc 
gcgctcagct 
gctcgcgggc 
cgaacaacag 
cggtcttccc 
cgcgggccga 
ggaaggcgtc 
cgcgatcctc 
cctcggtccg 
gcctggtcgc 
ggacgacgaa 
ccacgaactt 
gtccctgctg 
gcggcgggct 



cgctcctcga 
agcggctgcg 
gtctgccgct 
agcgcgccga 
ggctcgtctt 
ccggcgagta 
ccgaggccgg 
tgaagtgcca 
tgatcgatcg 
ctactggacg 
ccgcaccggg 
tctcgtgccg 
cgcggccgcc 
ctggcatctc 
tggcgctgcc 
cgacacgtgg 
ggacctcgac 
cagcacgccg 
ggcgcggctc 
cgcctccagg 
ggacgcaccc 
gggcggcacc 
gtgtctctcg 
catcgccgcg 
ggccggctgc 
ccgcgccgcg 
gaggcgcgcc 
aacgatttcg 
cgccgcaggc 
gccagcatcg 
ccgcaggcga 
cccgcgatca 
atcatcaacc 
tacccgcaga 
cacctcgaga 
gaggagcatt 
attcacgtgg 
cgccaggtgc 
tttcgcacgt 
gcgatcgcca 
gagatgctcg 
ctcgtggccc 
ctggcgatgg 
atgcacccga 
acctactcgt 
cattactgtc 
gaccgcgcgg 
gtttgaagag 
gtcggcgcgc 
cggcggctcg 
ggagcgccgc 
gccgaacgac 
gcgacgtgct 
acgaggcgga 
tgaagagtct 
aggaagtaca 
gagctggcgg 
gatgaggtcg 
ttgcagtcga 
gtcttccacg 
ctgggcttcg 
ggcggcctgg 
gagcgcgagc 
gtcgcfccccg 
ccggattgcc 
gcggtcggtc 
ctcccgatcg 
cgccgcgtcg 
cttcaggacc 



ccgcgtgacg 
cgaccggacg 
cagcgggccg 
tcggctgcga 
cgatgcggtg 
cgagcacfcac 
gttcacgatg 
gatgtatctc 
gctcggcggc 
ggtacctggg 
tcgcgggcga 
aaagatcgcg 
gtgctcgagc 
ctgagggcgc 
gatctgcccg 
atcgcggccg 
tggtatcggc 
tccgatccgg 
cgtccggaga 
cgcgcgctcg 
gacatgcgcg 
ggcaaggcct 
acgcgcgaga 
cgcgatgccg 
gccgagcgcg 
tacgacgcgc 
tcttgacgaa 
ccgactacgt 
tccatcgggc 
gcgacgtcgc 
cgatcgactt 
acaccgtcta 
tccagaacct 
ttctcgagca 
acggctcgga 
tcctcacggc 
acgcacaggc 
tggcgatctt * 
actccatcga 
tcatcgggca 
gccgccatcc 
tcatggactc 
cccgcggagt 
cgctcggcga 
ggcgatggaa 
acaacgaggc 
cggcgcggtt 
tcgatcgaga 
ctcctcatcg 
gccgccgact 
gcgttgcctg 
tacgcgtacg 
catcggcatc 
aaagaaacgc 
cgccggcgtc 
cattaccgtc 
tagacgtcga 
gcggcggcgt 
ttcgcgtagc 
acttcacgca 
agcaccgaga 
agcagcaccg 
tcggcagaga 
accagccagc 
gtagcctgcc 
tcgaggccgg 
accgcgttcg 
atcatcacgg 
caggcgatct 



ctcgtcgacg 
ctcgccgcgc 
ttcgatctcg 
ctctaccgcc 
aatgagattg 
gatgcgctgc 
gtgtcgctca 
gcgccgaggt 
gaaccgctcg 
atccgatcaa 
acgcgacggt 
tgctcgtgtt 
cgcgcggcga 
tggggcggcg 
gggcgatcgc 
gtatgtctcc 
cggtgccggt 
ctgagatcga 
tcgacgtgct 
acgccctccg 
cctgttacgc 
gtccgaattt 
acggcatcgc 
cgtcgctcgc 
cgcgccggct 
tgtacgtgga 
gggactcgcc 
ccgccatcga 
gctcggcggc 
gcgcgcgacg 
cctgacgtcg 
cacgctccag 
cgtgccgccg 
gctctcgagc 
gtccacgccg 
gctcgtcccg 
acgagcggcg 
cctcgggtcc 
ctacgtgaag 
gtcccaggtg 
ggacgtgatc 
gttttccctg 
gccggtcatc 
cgagcgcttc 
gttcttctgc 
gttcgtgttt 
gctcaagggg 
cgaagcgcaa 
aggcgctcga 
cgcagcattt 
ccgtggcgct 
agaccgtgtc 
acgacgtccg 
atgaaacgaa 
tgccaggtca 
atccacatct 
acgtgcgatc 
cggtgaggcg 
cctcggcgtc 
cgcccgggat 
gcggcagccc 
gcgcgtcctg 
ggcgcctcag 
actcgaccgg 
ctttgtgtgg 
cggccgtacg 
gaatcgcggt 
cgatgtagcg 
cctcgggatc 



cgagcgccca 
gggccaatct 
tctacacctt 
agattgcgac 
tgtctgcgcc 
tcacaccggc 
ccggtgcgca 
caacggtgct 
agtggatcgt 
agaagcgatt 
cgtgtcgttc 
gtccggccgc 
cgtgacgcac 
gccggttctg 
gcacgtggcc 
cggccgcgtg 
ctcgtcggag 
gccgcgcggc 
ggtgccgtgg 
tccgccaccg 
gcgggcgcac 
cgtgctcgag 
ggagatcctc 
cgcgagcctg 
cgcagaggag 
ggtggcatcc 
ggacggctgc 
tcgcagatga 
cgcgtgaacc 
ggcgtcctcg 
gacgcgaccg 
gatctgtcgc 
gcgagctttc 
cgcagccgfct 
acgaacatcc 
tacgacacgg 
gcacgatcga 
aactcggtcg 
cgcctcgtgc 
cgcaacccgg 
gatctcgtgg 
ctcatcagtt 
gggctgtacg 
gtggccatca 
tcgacgtgcc 
ggcgtggacc 
agctccgatg 
agcgctctcg 
ggccggacac 
cgccgccgag 
cacgaccgac 
agccggcagg 
gcaactcgcc 
tcgcgctcac 
tcgtcccgtc 
gggcgaagtt 
ttcgaacgtc 
gcggcggagg 
gtccgcctgg 
ggtggagccg 
ctcgtgcgtt 
gcggaagccg 
cgcctgctcg 
caggccgcgc 
cgcaagattc 
cgcggcttgc 
gaccggcacc 
cgcacacgfcg 
cggttcgatg 



gatgcttcac 
cgtgcaggcc 
caggctcgtg 
gattctcgga 
gcttcgcgcc 
cgggctacgc 
tcgccggttc 
cgcgcgcgcg 
cgtatgccgc 
tcgaaggaag 
tcgccggccc 
gcctggatcg 
gtcttcggcg 
ctcaccgccg 
gtcgaagtcg 
cacctcattc 
agattcacgc 
attccgctcg 
cgccggtggg 
aacttcatcg 
gccaccgtcg 
ggattggcca 
gatcgatcgg 
gacacgctcc . 
acgttcgacc . 
gccgccgctg 
tcgggtgggc 
agcgcgacgc 
ggatcctcat 
gggccctgcg ' 
tggccgtcat 
gcctgtccga 
tcagaacgtc 
ccaggatcat 
tctactgcat 
cccgctaccc 
agttcgcgtt 
ggtgcggcgc 
gtcacttcga 
ccgaacgcgc 
acaagacgtc 
gcgactcgag 
tcaacgacgg 
acagcacgcc 
gcgacccggc 
gcattccgct 
caggacctca 
ctcgtcgatg 
aagatcctcg 
ctcgtgggcc 
acgtcgatcc 
tgagtgcctc 
caacatcctc 
cggacgcggc 
agatcagacg 
gatcgagaaa 
gaccaggagt 
tgctgatcct 
acgagaaagc 
atcacgggca 
gacggcagca 
agaaagcgca 
tgttccttgc 
gacatcagca 
gcgagcatga 
cggtcgccgg 
ttgtcccggc 
acgatgtgct 
tgaaagtgca 
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13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 



cggccacgcg 
tgtgcacgtg 
cggcgagctg 
cgccctccgc 
ggacgccgcg 
ccagatgtga 
cggccgggcc 
gcagcaggct 
gccgcatcgg 
cgtactcgag 
gcgccgactg 
gcatcacgat 
ccgggatgaa 
cgccgtggat 
acgaagcggb 
gaagatcgag 
tgaagctcca 
cccacgagtc 
gattgcgggc 
atctcgccag 
cgggctgata 
tgaacaccgg 
gctctcgagc 
acgtttctcg 
gatgccgggc 
ggctgttcgt 
agcggcgccg 
tcggcgacct 
agatcgaatg 
gatcgctcgt 
gtgccgcagc 
acgagcacgg 
tacgtccatc 
caacgccccg 
cgacgcgaac 
cccacgatcg 
tctgggtcgg 
cgcggaaatg 
gccggtggat 
cgaacgtgac 
tccccggcgt 
acacgatctc 
acgcgtcgag 
catgctcgca 
cctggcgctt 
cgcfccaacac 
cgagccgcgc 
cgccggcaaa 
tgatcgcacc 
cgacttcgat 
ggatcttcgt 
cgagcgcctc 
cgtccagcgc 
cccgcaggct 
cgggggccgt 
cagccgaaga 
ggctggttca 
tgcagggccc 
aacatccaga 
gccatcgaca 
ggcagccaga 
~ "accacggagc 
tccctgggaa 
aagcggagcg 
tacgcgaccg 
ttgatgagcg 
agcaggtcgc 
atctcacggc 
ccgggcggat 



cgcgcgcgcg 
gaccaggggc 
ttgcaggccg 
gtcgagcgcg 
gcgtttggcg 
gacgaggatc 
gccagcctac 
gcgaagtccc 
caggcccgcc 
gtggccgagc 
atcgctcccc 
ggacgacttc 
accgaccggg 
gccggtcgcg 
cacggcgttt 
cgtgcgcagg 
gtcggcgtgc 
gtcggcgtcg 
gccggcaagg 
gatgtctgcc 
ggtctgggag 
gacgatcacc 
ggcttcagcg 
aaagccacgg 
cgaggtagct 
tgatgtggcc 
tgttcgtgag 
cgaggctcat 
agttcagcgc 
agccggccgc 
cgaagtcgtt 
gcgccgcctc 
gctcgaagcc 
accagtagag 
gcgcggcggc 
atgttccttg 
gttgtgaaac 
gaggtcgagc 
gttgccgtcg 
gtcgtccagc 
cacttcgctg 
gacggggctc 
gacttcgatg 
gagggtcgcg 
gaactcctgc 
ctcgtcgatg 
gttcatgccc 
ctcgacgatc 
ctggaggaac 
gagcgcgccg 
cagcaccttg 
gccgcggtgg 
gtcggcggga 
gtcatgacgc 
cacgtcagac 
cgagcgtcac 
ggagcagcac 
actggagctt 
cggcgatgac 
ggatcagggc 
ggatgttcgg 
cgatcgcgaa 
agtacacctt 
ccgaggagaa 
gcgtgtcgat 
gcatgaacac 
gccgcgtcat 
aatcggccag 
catgagggca 



gcctgcagcg 
cgcccgacga 
ccggcccgca 
gcgcccgcag 
gcggccgaca 
tcgtgcaggc 
gttactcgcg 
gacgggagga 
tgaccgaacc 
gcgagcagga 
cgcgccagga 
gacatgttgg 
cccttcgccg 
aacatctcct 



cgcaggacga 
acctcgccgg 
tgtcggaaga 
agcagcgcca 
ccccggttgg 
gtgccgtcgg 
agcgcggact 
gacacgaggc 
gccatggccc 
agacaacggc 
gaacccgcgt 
cattccgccc 
cacggcgacg 
ggagaggtcc 
gacgccgagc 
cttcacgcct 
catcgatcgg 

ggcggcgacg 

cagccgttcg 
ccgcttgagc 
gcatgcagcg 
accgccagcg 
acgtgcaccg 
gagaactcgc 
tagacgacga 
ggctcggtac 
tgcgtgcccg 
ccgacactcg 
gccgtgccgg 
accgcctgca 
atgcgatcga 
atcagcacgt 
gacgagtagc 
tcgtcgaacc 
acgttctcgc 
gcgcgccccc 
agggtggtcg 
acttcgaacg 
cgcggttgaa 
tcgccccgcc 
gttctccgcg 
gaccgagacg 
ggcgcggaag 
gccgcccacc 
gacctcgaag 
gatggccgtg 
tccgaccggc 
gtccacggeg 
cgagacgaga 
gaagttccag 
ggccgcgacg 
ggcccacccg 
ctgcatcagc 
gaaggcgcgc 
ggtagcgctc 



cgggtctcat 
gacccaatcc 
gcgcgttcag 
gaccctcgcc 
accggatcgc 

ggggggactg 

tcgcaggtcc 
agcaoacggc 
gtttccgcgc 
gacgatggcg 
accgctccag 
tgtcgtgcag 
agaggcgcag 
gccggagcac 
ggtcggggag 
cctcgtcgat 
gatcgacctg 
cgaattcccc 
cctgagagat 
tcgatccgtc 
gcacggccct 
cggggatatc 
tctgcactgg 
gcgctcttca 
gccgcgaaga 
tggcccggct 
agcttctcgc 
gttgcgtccg 
ttttccgcgg 
cgtgcgagga 
atgccgggtc 
tgcccgaaca 
ccgatccggg 
tgggtcacga 
acgccgagaa 
tcgcaagctg 
acaggtagta 
ccatgaacgc 
gctgatcggt 
ttctcgcggt 
acgactcgtc 
gctgttccga 
cggccttcac 
ggtfcgtgcga 
agcacttctg 
cgggccgcag 
gcttgacggg 
gctcggcgat 
gcccggtgag 
ggacgatcga 
acttgcccgc 
agacgtttct 
acagcccgcg 
ggaacttctt 
aattcgaact 
atcgccgtga 
ccgtcgatga 
atttcgaccg 
aggtatttca 
gtgaatatca 
acgtggtagt 
ctcacgatca 
ttcgtgttcc 
acccacaggc 
cgcatgaaga 
aagcccatca 
agctcgcgat 
atcgtcagcg 
gagctgcccg 



gaatccgtac 
cggaagcatg 
gtcatacatc 
tgtaatccat 
cagcaccgcc 
attcatggtg 
gcccgcgcgg 
gtaccgcgtg 
cgcgtgaaac 
atgatcgctg 
cgtcctgacc 
cctgtaatgc 
ccagaggtcc 
ggcggctttg 
acagcgcccc 
gacggcgcgg 
ccggcgaatc 
tcgcgccgcg 
caccgtgatg 
gtcgacgacg 
ggcgatgtag 
ggtctgggcg 
cgtcgcgagc 
tccgctcccg 
tcttctgcca 
gcgcggcgct 
caagcggcgg 
cggccgtgga 
cgtaccgtcg 
acgcgtgcac 
gcaacgccag 
gatccatgac 
acatcttcag 
tcatgggttc 
ctccacatcg 
ccgcgacggc 
gtggccgcga 
ggacgcgtcc 
ggagcgttgc 
gacctggagg 
gccgacgaag 
tttcctggac 
cgagccggcc 
gacgaggacg 
ctggaacgtc 
gtgcgcggcg 
cgtgtccacg 
ctcgacccgt 
gtcgggatgg 
gcggceggtc 
cccgtttgcg 
gaccgcccag 
cacgagcgcc 
ggagacgttg 
cggagcgatg 
cgcccagcat 
tcggcgtcat 
ggtagaccgt 
cgtcgcggta 
cgagcacggc 
agatcatgag 
cggcggagaa 
cggtcaggga 
cgcagaacgc 
tcacggagaa 
ccgactgctt 
actcgaactg 
gtgtcagtcg 
aagagcaact 



acgacaggat 
cgcgcacacg 
cgccgccgga 
gccgtgcacg 
gcgccgccga 
gcgcgcgatg 
tagagcgagc 
agatcggtgc 
tcgccgcggt 
gcggcgcgat 
atgccttcga 
gtcaggggtt 
cagtcctcga 
accatgaccg 
tcgagcggcg 
cccgtatgaa 
ttggccggat 
gcgatggcgc 
gcgccgccga 
atcacttcga 
cgctcgccat 
gccggcgagg 
gaacagcatg 
cagcgtctgc 
gtactcgagc 
gaagagcacc 
cggaaggtgt 
caccgccgtg 
cgcgtactcg 
gaagtgaccg 
gaccccatca 
ggccgcgttg 
tccggcctgg 
gtcccgggcg 
gccacgccgg 
acgaggaacg 
acgagatgcg 
ccgagatccc 
accatgaagc 
ttcagctgct 
agcgagccgg 
gcgcggacgt 
agatacaccg 
atcgcgatgc 
atgtcgccga 
atcgagaagc 
aagtcggtga 
ttcatcccca 



aaccccgcgg 
gtcggcttca 
ccgatgatgc 
aactcctgtt 
ggcacgaggt 
tcgaagacga 
gaagaacagc 
cacggacggc 
cgggttgagc 
cgacgacgcg 
gaagaggttc 
gagcacggcc 
cgcgacgagc 
cgggaagatc 
ggtcacggag 
gaacaccgga 
gacagccgtg 
gtagcgcagc 
ctcggcgagc 
cggcgtcgtg 
gcacgaactc 
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17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 
- 21000 - 

21060 

21120 

21180 

21240 

21300 

21360 

21420 



ggtcgcgcgc 
gcggacgagc 
gctccagtac 
ccgctcgatc 
caccttgacc 
catcggcgag 
cacgtagggc 
cgccatgacg 
cgggacgtfcc 
aaacgccgcc 
ggactcacac 
cccagaccac 
tgtggaagaa 
ccacacgcgc 
gcatgccgac 
tgagcgaccg 
tcacgttgcc 
gagacgccag 
tcatcagcac 
ggaacacgat 
cgccgagcaa 
cgctcggcgt 
cgagcggtac 
accgcacgat 
ccagtctggc 
ggcagaactt 
gacgggattg 
atcaactcct 
aaatgtcatc 
ccgacagaat 
gaaaactgca 
ggcattgcag 
tggagcaggg 
cgggctcggc 
gcgccggccg 
aacgcgctcg 
tccctcgacc 
gcctacgcgc 
gccggggcgc 
gtcgagctca 
ccggacgccg 
gggcgcccga 
gtcgacgcat 
gacgtctcgt 
tccgaggaca 
gcccggctgt 
ccggatctgc 
gcgcfccgbcc 
gccgaagcgc 
ctgccgccga 
cagcccgtcg 
ggctacttga 
gatgctcgcg 
ttcgtcgggc 
atcgagaccg 
gacgcgtcgg 
accggagccg 
cgcgcggcgg 
acggggatgc 
atccggtcgc 
ttccgtctcg 
gaccacgtcc 
gaggcggcgg 
gtcgcgcagc 
accgtgcgtc 
atgtttcacg 
cgcgagcgcg 
ttcgacgccg 
ggtgccttcg 



cgctcggcgg 
gcggcgtccg 
tcgctggcga 
cccttctgga 
agcgcgcggc 
tggatcgtgt 
tcgcgcggcg 
tccttggtga 
gcgagcggct 
gtgccgagga 
cgtggccggc 
gacgaacaac 
cccctgcgcc 
gccgttcgcg 
gcgcggatcc 
gatcccgctg 
ttcgcgcagc 
gagctgcagc 
caggaacgcg 
cgcgccggta 
ccccgcgacg 
gtccggcagc 
gatcagaatc 
cacctgccca 
ctgaagcaca 
gggcgccgaa 
gccggctgga 
ccttttgact 
tgtcaggaag 
cgcgtaccac 
ttacaggaac 
gcgctttcgt 
cagggacagg 
gggagctcgt 
taagttttga 
cgcaccgcct 
ggtccatcga 
cgctcgatcc 
ggatcgtgct 
tctgcctcga 
gcgtcgatct 
agggcgtcgc 
cggccgtcgg 
tccaggaaat 
cgcgccggga 
atctcccgtt 
cggcgttgcg 
agtggttcgc 
atgtcgtcac 
ttggcgtgcc 
cgcacggcaa 
atcagccgca 
tgtatcgcac 
ggacggacga 
tcctccttca 
gcagcaagag 
aggcttcggc 
cggactcgac 
cctacaccgc 
atgcgccacg 
cgccccactg 
gggcccacgc 
actttaccgg 
acfctgcctgg 
cgggcggatt 
cgtcggtcga 
tgcagcgtca 
tcatggaccg 
acacggagat 



cgacgtcgcc 
atcggttcag 
tcacgcggcc 
tgacgtatcg 
cggtggatac 
cgccttcgcg 
ccaggtagat 
agggcgcgct 
ggcgaagcgg 
aacacacgct 
gcggfccaccg 
atgacgaacg 
gcggcctcgc 
ataatcgcga 
gtgaagtagc 
cacgcctcgg 
accgggatgc 
ggaaatgcga 
agcggaaacg 
agcgaggcta 
agcagcgcga 
gccgagagac 
ccgtgggagt 
taaaggagca 
ctcatagcga 
acacgtaacc 
ggggcctcgt 
gcgtagcacg 
agtaatgtct 
cgcgactggc 
gccaggtcgg 
ctgccggggg 
catggatggt 
ccacctggcc 
cggcgtctcg 
gcagcgtcat 
gctcatcgtc 
tgcgtatccg 
cacctcgcga 
tgcggacgag 
cgaccacctc 
gatgccgcat 
cgccggcgcg 
ctttgcgacg 
tccgcgcgcg 
cgtcgcgttg 
tgaggtcatc 
gcggcatccg 
tgcgcacacg 
gctgcccggc 
cgtgggcgag 
gcgcacggcc 
cggcgatcfcc 
ccaggtgaag 
gcatcccggt 
gctggtcgcg 
acaggtctcc 
ggatccggcg 
ggcggagatg 
cgaggtgctc 
cgctgtctat 
gacggcctgc 
cgttccggcg 
cgtggactat 
catcttcgtc 
gctcacgaaa 
gatggcgctc 
catccccggc 
gaaccggttc 



gttttcgctc 
ccgcatcgcg 
gtggctctgg 
gttgatctcg 
gggctcccag 
ctggctctgg 
ccggttgatg 
gtcgcgtccg 
gacctgctcg 
gagaatgatc 
ctcggggcgg 
cggcaagaaa 
cgtagtagtg 
tcgggatcgt 
cgtacacgat 
ccacttcgag 
tgagcgcgga 
tctgattgaa 
cgagcaccca 
gtatcgacac 
gcccggcgag 
gcgcccggcg 
agttgtcgtc 
ggaaactcac 
tcgggacgaa 
atcacgctca 
gaatctaccc 
gaagatcatc 
ctggccacag 
atccggcgag 
cagccggttc 
gcggaaccgt 
tccgtgagac 
gtggaccgcg 
ctctcgtatc 
ggcgtcgatc 
gggatcctcg 
aaggaccgcc 
cgcgtgagcg 
cccggggcgt 
gcgtacgtca 
cgtccgttgg 
cgcacgctcc 
ctcgcgaccg 
ctgctgcggg 
cagcagcttg 
acggccggcg 
cagtgcacgc 
ctgactagcc 
gtcgagattc 
atcctcattg 
gaacggtttc 
ggccgtgtgc 
attcgaggct 
gttcgggaga 
tatgtgattc 
ggatggcaga 
ettgccgcga 
cgggagtggg 
gagatcggct 
agcgcgacgg 
ggcgccagcc 
gccgcgttcg 
ctgacgcgcg 
ggcgacgtgc 
gcgccggcgg 
gagcgcgagc 
atctcgggag 
aggtacgacg 



gacggcacgg 
tcgtacacca 
taccagtaca 
atgggctcgg 
ccggcgcccg 
tagtagccga 
tactcgtcca 
gtccaccgct 
gcgcgcatcg 
agtcgtctgg 
aggcgtgaag 
gacgagccag 
cgccgccacg 
cgagagcgca 
gcccagcgtg 
cgtcgtgtgc 
cagcacgccc 
gatgatcgac 
gagcttgcgc 
gcgcgtcaga 
cacgatgacg 
ctcccaggca 
gttgatccag 
ggcggcgata 
tatagcaaag 
caggaagtta 
aatggatgac 
ggcctcttca 
cgccggtcaa 
aaaagtcgcc 
gttttccgtc 
gcgaccgggc 
agccgcgttc 
cggcgcgcca 
gcggcctgac 
ctgacgtggt 
ccgtgctcaa 
tcgagtacat 
gtctgatccg 
tggcggaaga 
tctacacctc 
cgaatctcat 
agttcacttc 
gcggcgagct 
tgctgcgcga 
ccgtcaccgg 
agcagctccg 
tccacaacca 
cccccgcgga 
tgatcgtcga 
gcggcgtgtg 
tctcgcatcc 
gcgacgatgg 
tccgtgtgga 
ccgccgtcgt 
cgtcagccga 
ccgtgtggga 
ccggctggcg 
ccgatgagac 
gcggcacggg 
atgtctctcg 
acgtcgagct 
atgccgttgt 
tcctcgccgg 
cgaacctgcg 
atcttccgat 
tcgtgatcga 
cccaggtcca 
tcgtgctcac 



ggctcgtcac 
tgtagatctt 
gcacgatttg 
tgcggcccgg 
gcaggcagtt 
tgtagagcga 
cgccgagcac 
ccatctggaa 
agcgcgcggc 
tcatgaatct 
cgccgcagga 
cccgagaacg 
cccgtgccgg 
atcgccaggc 
agcagcgaga 
gcgagcacga 
tcgcccacgc 
gggatcggga 
cagccgagca 
aagagctcgg 
agaccgagcg 
aagtaggccg 
tcctgcgcca 
ccgagcggaa 
ggctctcccg 
gcggctaaat 
aaatatccct 
gatgcaggac 
gtgaaatccc 
tcgtcggccc 
agtatttttg 
cgatcacgcg 
gggagacggc 
atttcctgac 
tcgccgcgcc 
ggtcggcatc 
ggccggcggc 
ggtcagggac 
cgtcgatggg 
gggccatgcg 
gggatcgacc 
cggctggcag 
gccgagcttc 
cgtgctcgtg 
tcgatccgtc 
caccgacgat 
ctcgacgccc 
ctacggcccg 
atggccggcc 
cgagcatcgc 
cctcgctcgc 
gcttcgcccc 
cgcgatcgag 
gccgggcgaa 
cgcgcacgac 
tcaggcggcc 
ggacacctac 
gagcagctac 
cgtcgcgcgc 
catggtgctc 
cgccgcgctc 
cctctcgcgc 
gatgaactcg 
cgccgcccgc 
cctgctcgag 
tgccctcgtg 
tccgatgttc 
gatccggcag 
ggtgggcgga 
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2X540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 

22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22680 

22740 

22800 

22860 

22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 . 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 



cggccggccg 
cgtcagctcg 
accaacgcgc 
accggcacgg 
ccggaatcgt 
gccgagatcg 
gccgctcccg 
gcgacgaatc 
ctgcaggagc 
ccgctcacgc 
cccgatctgg 
gtctggcaga 
ggcgggcact 
aagcagtggt 
ctcagtgagc 
cgccagcgcg 
taatgcagaa 
tcgaggagta 
aggagctggc 
cgcgcgggct 
gcgaagccga 
tcgaacacgc 
tgagcacggg 
tgctccagcc 
gccgcgtctc 
cgacgtcgct 
cggcgctcgc 
agggcggcat 
cggtcttcag 
acgggcagac 
aggtcagtfct 
gcgccgccga 
cgctcggaga 
cggagaacgg 
cggccggcgt 
cgacgctgca 
tgaacgccgc 
cgttcgggat 
agcgaccggg 
cactcgacgc 
tcgccgacgt 
tcgcctgcac 
tcgccaccgg 
gcacgcagta 
aaaccgagcg 
cggatcatgc 
ccgcgctctt 
ccgacgggct 
tgtcgctcga 
cggccgggga 
atggactcgc 
acgcgctcgc 
acatcgcggt 
tcctccgcgg 
cgtggctgac 
ccgtcaggtt 
aagtcgggcc 
cgcacgtcgt 
tgcagcaggc 
acagaggcga 
acctggtcga 
- ccgatgctac 
atcgtatccg 
agatcgatcc 
cgagtctgcg 
aggcaccgac 
ttcctgcgcc 
gccctcatgc 
ttcaacagca 



ggcgccgtcc 
ggcaactgct 
ggctgcgtca 
ccggcgagac 
ggcggcgtct 
gcgccatcgg 
tgactcgtca 
cgctccgcga 
gcgtgccgga 
caagcgggaa 
acgtggcgtt 
cgacgctcca 
cgctcctgct 
cgatgatcga 
ggccggacgc 
agatgctgac 
cggcatcgcc 
ctggcgcaac 
gacggccggg 
cgtcccggat 
gctgatggat 
ggcgatcgcg 
gatgacgaac 
ggaggacgtg 
gtacaagctg 
cgtcgcggtg 
cggcggcgtg 
cgggtcgcca 
caacggcgtc 
gatctacgcg 
cgcagccccg 
cgtggcgccg 
tccgatcgag 
gttctgcgga 
cgcggggctg 
ctacgtcgag 
gctgcgacca 
cggcgggacc 
cgccgagccc 
ggcggctgcg 
ggcgcacacg 
cagccgcacc 
cgacgccgaa 
cgcgaacatg 
ctgcctcacg 
ggacgtcgag 
tgtcgtgcag 
catcggtcac 
gaacgcgctc 
gatgctcggc 
gatcgccgcg 
ggcctttgcg 
cgcggcgcac 
catcgcgctc 
cgcggacgaa 
cttcgacggg 
cggacagacg 
tcaagcgacg 
ggtgggcagg 
gggcaggcgt 
atcgccccag 
tcagcagatg 
cgagcggctg 
cgcgctgtcg 
gttcaaggcg 
gatcgacgcg 
ggcggaagcc 
ggccctggct 
attgcagttg 



gtccgtcgcg 
cagcgccgat 
gccggccgcg 
gcttgagagg 
cgcgaacgcc 
cttcgatgcg 
cgcggfctccc 
tgcggtggcg 
tcacctggtg 
gatcgatcgg 
tgccaagccc 
gatctcgtcg 
cgcgcaggcg 
gatgttccag 
cggcgccggg 
gcgtcagggg 
ctcgtcggga 
ctcgtcgccg 
afccagccaga 
ccggacctct 
ccgcagcacc 
ccgcggtcgt 
agcacgtacc 
ctgccggcgc 
cacctgcgcg 
gtgcaggcgt 
tccgtgtcgt 
gacgggcact 
ggcatcgtcg 
gtgatccgcg 
agcgtggacg 
cagacgatcg 
gtggctgcgc 
ctcggatcgg 
atcaaaacgg 
cccaacccga 
tggccgcgcc 
aacgcgcacg 
gccagcgacc 
aacctcgcga 
ctgcgcgccg 
gatgcgatcg 
tccggcagcc 
ggccgggagt 
cacctgtcac 
cgggccgcgc 
tacgcgctcg 
agcatgggcg 
gcgctcgtgt 
gtcccgcttc 
gtgaatgcgc 
cgcgtgatgg 
tcgccgatgg 
gacgcgccaa 
gcgaggtcgc 
ctcggcacgc 
ctgtccggat 
ctccggcacg 
ctctggacgg 
cgtgtgccgc 
gcgccggtcc 
acgacaactg 
accgaaatcc 
tttctggaga 
gagttcaagg 
ctcacgagct 
gctccggcgc 
ggcgtgccgg 
atggccgagc 



ctcgactggc 
gcgccggacg 
tgcttcgatc 
gccgccgcgg 
gccggctacg 
cggttcgagc 
gccgcggagc 
cggcgcctca 
ccgtcggcct 
cgcgcccttc 
gccacggagc 
gtgggattgc 
ttcgagcgta 
tacccgacgc 
cagcttgccg 
cctgccctga 
tggccggccg 
gcacggagtc 
gcgatctgca 
tcgacgccgc 
ggctcttcct 
tccccggtct 
tgctgtcgaa 
tgctcggcaa 
gaccgagcat 
gtcagtcgct 
tcccgcaaca 
gccgcgcgtt 
tcctgcgaag 
gatcggcgct 
gccaggcgga 
actacgtcga 
tcacgaaggc 
teaagagcaa 
cgctcgcgct 
agtgtgattt 
gtggccatcc 
tcgtgctcga 
tgctcgtgat 
gtcatctcga 
gccgacagga 
ccgtgttgcg 
gtgccgtggc 
tgctgccggt 
tctccgccga 
attggctcga 
cccgtcagtg 
agtacacggc 
gtctgagagg 
ccgaggcaga 
caagctcgtg 
ccgaggccgg 
tcgatccgat 
gggttccagt 
cgcagtactg 
tgctcgccga 
tcgcacggca 
ccaaggaacc 
caggggtgcc 
ttccggccta 
cagcgccgct 
-tgccttcgtc 
tgcacaaact 
tgggattcga 
tccggatcac 
acatcgacgc 
ctttcgtggc 
cggccgcgcc 
aactccgcat 



gtacggagcc 
cgatcgagat 
tcgtcaagcg 
acacgcgcga 
acgccgagtt 
gccggggcac 
cgcgggccat 
cgcccgagct 
tcgtcgtact 
cggcgcccga 
tcgaacgcaa 
acgacaactt 
ttcggccgct 
tctattcgct 
gcgcgcagga 
gcagagtcga 
ctttccgggc 
gatcacggtc 
gaatcccgac 
gttcttcggc 
cgaggcgtgc 
catcggcgtg 
tctccacagc 
cgagaacgac 
gaacgtgcag 
gctcacgtgg 
ggaaggctac 
cgatgcgaac 
gctcgaggac 
gaacaacgac 
ggtcgtggcc 
ggcgcacggc 
ctttcgcgcc 
tttcgggcac 
ccatcaccag 
cgccaacagt 
gcgacgcgcg 
ggaagcgccg 
ctcggcgaag 
ggcgcacccg 
gtttccgcat 
cggcgcggac 
cttcatgttc 
cgcgcaggtc 
cgtccgccgt 
agctccgtcc 
gatggcgtgg 
cgcgtgcctt 
ccggctgttc 
ggtgcggccc 
cgtcgtttcc 
tgtcgaatgc 
cctggcggag 
cgtgtcgaac 
gacgcggcat 
gccgaatcgg 
gcatccacag 
ggcgccggac 
cgtgaactgg 
tccgttcgag 
tcaagccctg 
tccggcccga 
gagcggcctc 
ctcgttgttc 
gttccgccag 
caagctgcct 
cgcaagcgtg 
cggcacgctg 
gctgggcgga 



ggtcgccgag 
caggaacgtg 
tgccgactgg 
cgcgatcgac 
cacctggtcg 
cgcgtcgctc 
tcagcagtac 
tcggcgattc 
cgatgcgatg 
gagccgccgt 
gatcgcgcag 
cttcgatctg 
cgtgccgggc 
cgccgggttt 
tcgcggacgc 
aggggtccgg 
gcgcggtcga 
ttcaccaacg 
tacgtccgcg 
atcagccagc 
tggctcgcgc 
tggggcggca 
cacgcgggat 
tacctcacga 
accgcgtgct 
cagtgtgatg 
gtctacgtcg 
gcccaaggca 
gcgatcgccg 
ggctcggcga 
atggcgcaga 
accggcacgc 
ggcggcgcca 
ctcgactcgg 
cagattccgg 
ccgttcttcg 
gccgtgagct 
cccgccgcgg 
acgccggcgg 
gacgtcagca 
cgacgagcgc 
gccaaacgtg 
tccggcggcg 
ttccgcgaag 
gcgctgtttc 
gttggtttgc 
ggtgtgcagc 
gcgggcgtga 
gagacgcttc 
gcgctcggcg 
ggcgcgcccg 
cgccgcctcc 
ttcgagcgtt 
ctcagcggct 
ctgagacaaa 
gtgctcctcg 
aaaacggcgg 
gtcttgttcc 
gctgcttaca 
aagaagcggt 
gcgccaaggg 
..tcgcgtgccg. 
ccggcaggcg 
ctgacgcagg 
ctcttcgaag 
gcggacgcgt 
ctgtccgcgg 
cagcacgtga 
aatcccgccc 
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25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 

29040 

29100 

29160 

29220 

.29.2.80 „_ 

. 29340 
29400 
29460 
29520 
29580 
29640 
29700 



tgctgcccgc 
cccgcgctac 
aagccagtgc 
acacacggcg 
atccacgctc 
ccgaccggtc 
tgggcgggtt 
aggaacaact 
cggcgctcgt 
aagccgtgct 
tcttcgaagg 
acggcgaatt 
tcatcctgct 
agatcgccgc 
agttcctgca 
aagtcatcac 
cggatcttgc 
gcaatgcgaa 
tcccggaggt 
ccgctgcgaa 
tggcagatcg 
cgccgtatca 
agacggcggt 
gtcccttctt 
tcgacgagtc 
cattgcaggt 
agatctggct 
cgctcgacct 
tcgcgcgtca 
ttccgggcac 
agcacgaggc 
gcgggccgtt 
tggcggccca 
cgacgttcta 
tgagccgctt 
aggcgttctg 
ggccgcggcc 
cgctcgctcg 
tgctcggcgt 
gcatcccggt 
tcgtgccgct 
tgaggacgca 
ccaagctgcg 
tcgagcccgc 
cgcgcgtgac 
tggtcgaggc 
atagctatcg 
tgccggtgat 
ccgactaccc 
cggagggact 
gcgcaaaccg 
cgctgtgcct 
gcgccgcata 
aggattccgg 
ccaggacgcc 
atgatctgaa 
cgacgggaca 
cggtgatcga 
cggccgatag 
tcacctcaga 
tcgacttcct 
cctcgctgct 
tggaagagct 
agaccacggt 
gcgtgccggc 
cgcccgcgcc 
ggtatctcaa 
agccgggagc 
tcgagttcct 



gatgccgccg 
gacggcgccc 
ggagctgtcg 
cacggccgga 
ggtcgccgga 
ggaaggatcg 
cggctcgctt 
ccaccggggg 
caaggaattc 
ggcggccacg 
gtcgtatcac 
gcggtcggcc 
cgagtacgcg 
gatcctcgtc 
cgaggttcgc 
cggcttccgc 
cgcctacggc 
gttcatggac 

gggggtcacg 
ggccgtgctg 
gatcgagcgg 
gattacgcag 
gctgctgttc 
cttcacgacg 
gctggccgag 
gaccgggacg 
cggcggccag 
gcgcgggcct 
cgaatcgctg 
gtcggtgccg 
cgtgcagaag 
gctgcgcgtg 
ccatctcgtg 
ttcggcccag 
cgcgcgtgag 
gatccggcaa 
gccagtgaag 
cggcgccaag 
cttcgagacg 
cgcggtgcag 
tcgcgcgcgc 
gtcactcgac 
gttgccgaag 
catcgtcgat 
atcgaagcgc 
cgactacagc 
gatcctgctc 
ctcggagccg 
gcgcgatcgc 
cgcgctcgtc 
gttggcgcat 
ggaacgatcg 
tgtaccactc 
tgccgtgctg 
cgtgctccac 
ggtcgccctg 
accgaagggc 
acggatcggg 
cggtgtgacc 
ggtcgcgacc 
gaagatcacg 
gccgcgccgc 
gcagcgtctc 
cggcgtgttg 
cggcaggccg 
gattggcgtc 
tcgtcccgac 
gcgcatgtac 
cggtcgcgcg 



ccagcgtcgt 
gtctcgaagc 
cccgagcaga 
tcgaagcgac 
ttcaaccggc. 
aaggtctggg 
ctgttcggac 
ttcgagctcg 
acggggatgg 
cgttttgccc 
ggtctccttg 
cccgcggcgc 
aacccgaagt 
gagcccgtgc 
cgcgtggcgg 
gtgcatcccg 
aaggtcgtcg 
gcgttcgacg 
tactttgccg 
acgtacatga 
ctcgcgaacg 
ttcagctcga 
ttcctgatgc 
gctcacaccg 
atgcaggcgg 
gcggaccagg 
atgggcgagg 
ctgaatcgcg 
cgcatgacgg 
gttgaatggc 
gtgctcgagg 
accaccatga 
tgtgacggct 
gttcgaggca 
gatcgcgagg 
ttcgacaccg 
tcgtaccagg 
aagctcgccg 
ctcgtctacc 
cctctgctcg 
gtgtcaccca 
ctcaatgagc 
cacccgggcc 
cccgggttcg 
gacctgcacg 
accgatctgt 
gaggccgtcg 
gaccggagtc 
gtcatccatc 
tgtggtgccg 
gcgctcgtga 
gccgggttca 
gaccccgtct 
gtcgtgaccg 
ctcgacgccg 
tcggccgaag 
gtcgtcatcg 
ttacccgccg 
gggctctcgg 
gacgggcggg 
ccgtcgctgc 
tgtctcatgt 
gcaccaggct 
acgtactgga 
ctcgggaacg 
gtcggcgagc 
ctcaccgccg 
cggaccggcg 
gactatcagg 



ccgtcgcggc 
cgcatgcggc 
tggccgcgct 
gcctggccga 
tctggaagga 
acgtcgacgg 
accgttcgcc 
gaccgttgcc 
agcgcgttgg 
gaacggtcac 
acgaagtgct 
ccggcatcgc 
cgctcgaggt 
agagccggcg 
acgagatcgg 
gcggcgccca 
gcggcggcat 
gcggccactg 
gcacgttcat 
agtcgcaggg 
acgcgcgcgc 
tgatgagcct 
gcgagcgcgg 
aggccgacta 
ctggcttctt 
acgtcctgcc 
cggcatcacg 
cggcgatgca 
tggccagcga 
aggacgtgtc 
gcggcgacgg 
cgctcacccc 
ggtcgttcgg 
cgcgtgcgaa 
ccaagcagag 
tccccgagcc 
gcgcccgtgt 
ccgaacaccg 
gactgaccgg 
gcgaggatct 
ccgcgacgtt 
accagaactt 
gcgatccgct 
acggtctcac 
tgaacgtcat 
tcgacgaacc 
tgtcgtcccc 
agcttgtcac 
aactcgttga 
agcggctgac 
agaagggcgt 
tcgtctcggt 
cgcccggcga 
attcgcggtc 
acagtccgcg 
acgccgcgta 
ggcaccgcca 
gctcgagtta 
cgctcgcgat 
cgctcgggtc 
tcgcgtcgct 
tgggcggcga 
gccggatcat 
ccggtgaccc 
tccgcgccta 
tctgtatcgg 
agcgcttcgt 
accgcgcccg 
taaaagtacg 



ggccgacgcg 
cttcaagccg 
cgccgcctgg 
gtaccggccg 
gatgatctac 
caacgagtac 
gatcgtggtc 
gccgctcgcc 
attctccaac 
cgggcgcgac 
cagccggccg 
cggcagcgcg 
catccgcgcg 
gctcgacctg 
cgccgcgctg 
ggcgcacttc 
gcccatcggc 
ggagtacggc 
gcggcatccg 
tcccggcctg 
gatcgtggcc 
gaacttcccg 
gattcatatc 
cgccgccatc 
cggcgcgccc 
gttcaccgaa 
cgcctacaac 
gcgcgccgtc 
accgctcggc 
gggattgccg 
cgtggcgttt 
cgagcatcac 
cgtcatcctg 
cctcgaggcg 
tcctgaagcg 
gctcgagtta 
gtccgtgccg 
cacgaccctg 
ccagcaggat 
ggtcgcgcac 
ctcggagtac 
cacgtacggc 
cgtctcgacg 
ggcgcgttcg 
ggagatggac 
gacggtgcgc 
cggccggtcc 
cggctggaac 
agagcgcgcc 
ctggaccgag 
cgcccccggg 
gcttgccgtg 
tcgcaagtcc 
cgaaggccga 
aatcgcgcaa 
cgtgatctac 
gctcgtgaat 
tgcgctcgtg 
gggctgggtg 
gtatttgacc 
gctcggcgat 
gccgtcacgt 
gaaccactac 
cagggatctg 
cgtgctcgat 
cggcgcgtgc 
ccccgatccg 
gttccgggcc 
cggattccgc 



ggactgaagt 
atcgacggca 
atcgatcgat 
gtgcttgccg 
ccgatcatca 
gtggatctcg 
gacgccgtgg 
ggcgaggtcg 
accggctccg 
aagatcgccg 
ctcgtggtga 
gtgagcgaag 
cacgagtcgg 
cagccgaagg 
atcttcgacg 
ggcgtccgcg 
atcgtcgcgg 
gacgggtcgt 
ctcgcgctcg 
cagacgcggc 
aggcacggcg 
cacgaccaga 
tgggagggcc 
ctccgcgcgc 
gcccactccg 
ggacagcagg 
gaagtcgtcg 
gacgaggtcg 
ctgcgtcacg 
gaggacgctc 
gacgtggaac 
gtcttcgtca 
cgcgatctgg 
ccgatgcagg 
gccgagaccg 
cccgccgatc 
tttgacgcgg 
tttacgacgc 
ttcgtcgtcg 
tgcgtcaact 
ctgaacacgc . 
agtttgctcg 
agcttcacgc 
ctgacggttc 
ggcggtctgc 
cggtggctgg 
ctcctgagcg 
gacacggccg 
gtgcagaccc 
ctcaaccgac 
agcagggtgg 
cacaaggccg 
ttcgtcgtcg 
tggctgacgc 
gagtcgcacg 
acgtccggat 
tacacgtggg 
tcgaacgtcg 
ctgcacgtga 
gttcacgaca 
cacccgtcgg 
ccggcgtggg 
ggcccgacgg 
ttcgtcgaga 
ccggccggag 
gtggcacgcg 
ttcgccaccg 
gacgggaaca 
gtcgagctcg 
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29820 

29880 

29940 

30000 

30060 

30120 . 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 



gcgaaatcga 

gaaaagatca 

cgatctcgat 

cggcggtgat 

gcgccctgcc 

ccgagacgga 

gcgcgcacga 

tgaaggtccg 

ccatcgccat 

tcacgccgag 

ccacccgcag 

cgtgtttctg 

gcctgaaagc 

cgccgcgctc 

gttctcgctg 

ctccctcgtg 

tgcggccgac 

actcgtcatc 

cgacgggtgg 

cgacgggcgc 

ccagaaggaa 

gctggcgggg 

gacctcccgg 

cgcattcagc 

cgtgctgcat 

cgatcgtccg 

caacgcgtcg 

tgcggcctac 

gcggacccgg 

gaagatgacc 

gtgcgacctc 

cagcgtcgac 

gcttcagacc 

cgacgagcgc 

cgtcagtctg 

gacgctggag 

ccggcggttg 

gtcgttcgat 

gctcgatttc 

catcgtgctc 

ggacgacgag 

cggagaccac 

cgcgatcacg 

cgacgtgcgg 

gtggggcgcg 

ggagacgcfcc 

gctcttcaac 

gatcggcggc 

cacgcggctc 

gattccgcgc 

tacgcaggcg 

gctctgcctc 

ggaacggttc 

agacctcgcg 

gatcaaactg 

gggagacgtt 

cgcctacgtc 

gaaggcgcgc 

gcgaacggcg 

cgcgcccgag 

ctggagcgaa 

7/™" 



agcggcgctt 

gcccggcgac 

ggcggagctt 

cgtcgcgctc 

ggcgccgccc 

gcgccggctt 

caacttcttc 

cgacacgttt 

gatggcggca 

ggcccgcgtc 

actgagtccc 

gcgccggcgt 

tccgcctaca 

gaacgcgcca 

cagggcgatg 

gacatgcgtc 

gacggcgcgc 

ctcgagccct 

tccggcggga 

gacccgaagc 

cagaccgaag 

gcgtcgatga 

ggcgccctcg 

cggggcgaga 

cgctggtccg 

gagcttgagg 

ggagatccca 

gcgcatcagg 

gatcggcatc 

ttcgccggga 

acgctcgagg 

ctgttcgatg 

gcgatcgcgg 

atgcagctga 

gccacggtct 

ggacagcacc 

aaggcgttcg 

ctgctggtgg 

gctcacccgc 

acgctgcggc 

ctgccggcca 

ctggcctatg 

catcgcggcg 

gcgatcctcc 

ttgctgaacg 

gagcgcacgc 

gccgfccatcg 

gaagcgctgt 

gtgaacgcgt 

acgctcgatc 

ctcattctcg 

ggcggagacg 

gtgcccaacg 

cggatgcgcc 

cgcggcttcc 

ctggacgcgg 

gtgcgcgccg 

ctgccggaat 

gccggcaagg 

ccgcgcgatc 

gtgcttcggc 



cgcgctcatg 

gagcgcctga 

cgcacgtcgc 

gacaagctgc 

gagcgcgcca 

gcggagatct 

gacctgggcg 

cgcgtggacg 

gcaatcgacg 

agggcccgca 

ccgacctgcg 

cgtacgccca 

acattccgat 

tcaacaccat 

agctcaagca 

tgctcagtcc 

gcgtcttcga 

ccgatttcgt 

tcttcatccg 

tgccggagct 

ggcccggcct 

cgctcgacct 

cctcgcttcg 

acgcgacggt 

gacaggaaga 

gcctgatcgg 

cgtttcgcga 

acctgccgtt 

cgatctatca 

tcgagctcga 

ttgccgaaga 

cggcgacggt 

acccggacca 

fccgcactggg 

tcgaagagcg 

tcacgtacgc 

gtgtgggccc 

cgctcctcgc 

aggaacggct 

gatttgccgg 

ccgccgagga 

tgatgtacac 

tggttcgtct 

acatcgcgcc 

gcgcgcggct 

tccggtcgga 

acgcgaagcc 

cagtcgcgca 

acggtccgac 

cgcaggcgcg 

atcgacacgg 

ggctggcgcg 

cgttcagccc 

gcgacggcaa 

ggatcgagcc 

tggtggagat 

atgggagtca 

actgtgtgcc 

tcgatcgctc 

cgtcgttcct 

tgccctccgt 



agggcgtcga 
tcggctatgt 
tcaagcagag 
cggtcacgtc 
gtttcgagtc 
ggagcgcggt 
ggcactcgct 
tgtcgctcat 
ccggtaccgc 
gggatgagct 
gcgcgacggg 

ggggcgcttg 

cgcgctcaga 
cgtccagcgt 
ggtcatctct 
tgagaaacgc 
tctggctgag 
cctcctgatc 
cgagctggcc 
gccgatccag 
gccggtgctg 
gccgacggat 
gctcgacgga 
gttcatgacg 
catcctcatc 
gttctfccatc 
gttcgfccggc 
cgatcggctc 
agtcatgttc 
gcggcttgtc 
cgccgacggc 
ggcgtcgctg 
gcggctctcc 
gaccggtccg 
ggtccgcgcg 
ggatctcaac 
gaacgtgctg 
cgtggcgaag 
tgcgttcatg 
ggcgctcgag 
gccggacgaa 
ctcgggctcg 
cgtacgcggc 
gacgtcgttc 
ggtgctggtt 
gcaggtggac 
cggcattctc 
cgtgcggcgc 
cgagtgcacg 
gtccatttcg 
cgacctcgtg 
ggaatacctg 
cgagccggcc 
catcgagttc 

gggcgagatc 

ccgggtcgat 
actcaccgga 
cgcgaagtac 
ggcgctgcct 
gaacgaggtc 
gctcgcgacc 



gcaggccgta 

cacgaccggc 

gttgccggac 

gattggcaag 

ggagttcgtc 

gctcggtctc 

gctcgcggcc 

cacgttcttc 

ggcccctgcc 

ggtccgccca 

atcgccgccg 

tggttcctgg 

ctgaagggcc 

cacgagacgc 

ccgacgctgc 

gaggccggat 

ggcccgctgc 

accgtgcatc 

gaggcgtacg 

tacgcggact 

ttgaagcact 

cgtccgcgtc 

gcgatctggc 

cttctctccg 

ggcagcccgt 

aacacgttcg 

cgggcgcgcc 

gtcgagacge 

gcgcaggatc 

gtccacaacc 

gtcaccctgt 

ctgcagcaat 

gagctgcgtc 

gcggccgtct 

actccggcgg 

cggcaggcga 

gtcggcgtgg 

gccggcggcg 

ctcgccgaca 

tcctttcccc 

gatctgccgc 

actggtcagc 

acagactacg 

gacgcgttga 

cccgagcggc 

tgcctgctgc 

gtcggcgtga 

gccctggcgc 

accatcgcgt 

atcggccggc 

ccgttcggca 

aatcagccgg 

gcccggctgt 

cttggccgga 

gagcatgccc 

gcccacatgg 

accgacctgc 

gtgttcctgg 

gcatcggcga 

gaagcggagc 

gacaattttt 



gtcgtgctgc 

ggcggcgcct 

tacatggtgc 

atcgatcgac 

acgcccgcca 

gagaagattg 

cgcaccgcca 

gaaatgccca 

tcccggacca 

tgaaccccgg 

aagcgggcga 

cgcagctcga 

gcctcgaccc 

tgcggacgac 

gtattccgat 

ggcgccgcgt 

tcaaggtcgt 

acatcgtctt 

cggctgagct 

ttgccgcgtt 

ggacgcagga 

cgcccctgca 

cgggcgtcgc 

ccttcggcgc 

cggccggccg 

tcctgcggct 

gcgcgtgccg 

tcaacccgga 

cgccggcgtc 

gcagcaccaa 

acgtcgagta 

atcgccgtgt 

tgcttggcga 

acccgcgcga 

cagtggccgc 

accgcctggc 

cgctcgaccg 

cgtatctccc 

cgcgcgcgcc 

tcttctgcct 

cgcagtcgac 

ccaagggcgt 

tccgctggaa 

cgttcgaggt 

tcgtcagcct 

tcacgaccgc 

agcaactcct 

acctgccgga 

gcgcctacca 

cgattgccaa 

tcgccggcga 

cgctcactgc 

accgcacggg 

tggatcgcca 

ttcgtgcccc 

cccatctcgt 

gcgaacgtct 

ccgaggtccc 

tcgaggcgcc 

tcgcgcgaat 

tcgatcgtgg 
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Figure 5b 

CDS complement (3 . . 914) 

/blast score=2e-66 

/blastp — matchs"AE004644.PA2177 Pseudomonas aeruginosa" 
/genes "regulator hybrid" 

/note= "probable similarity to prokaryote sensory 
transduction proteins" 

/product^ n sensor/ response regulator hybrid" 
CDS 924. ,2168 

/note = "none" 
/gene= " ligase n 

/blastp_matchs«AP003013.MLR8297 Mesorhizobium loti" 
/blast_score=e-152 

/product = " 2 -amino- 3 -ketobutyrate CoA ligase" 
CDS 2207.. 3190 

/blast score=e-151 

/blastp_match="AE008872.TDH Salmonella typhimunum" 

/note="none" 
/gene= "dehydrogenase " 
/products "threonine 3 -dehydrogenase" 
/pfam_matchs"PF00107; adh_zinc; 1" 
CDS 3373.-4455 

/note= "putative ■ 

/genes "me thyltransf erase" 

/blastp_match="AE001866.DR0026 Deinococcus radiodurans" 
/blast_score=le-08 
CDS 4546.. 4959 

/blastjscore=2e-ll 

/blastp_match= n AP002997.MLIil617 Mesorhizobium loti" 
/gene- "unknown" 

/note= "pfam00263, GSPII_III, Bacterial type II and III 
secretion system protein, Expect = 7.8" 
CDS 5176.. 6192 ... 

/blast_score=5e-98 

/blastp_matchs"AF064070.PE20 Burkholderia pseudomallei" 
/gene= "glucose epimerase" 
/note- "putative 11 

/products "UDP-glucose 4-epimerase" 
/pfam_match=»PF01370; Epimerase; 1" 
CDS 6331.. 14043 

/note=»substrat AT, malonyl ; zinc depend dehydrogenase 
zinc dependent adenosine deaminase putative" 
/gene= "PKS i" 

/blastp_matchs"AP285636.WCBR 2547 Burkholderia mallei" 
/blast_scores0 . 0 
/pfam__match= "ketoacyl - synt » 
/p f am_mat ch= " ketoacyl - synt_C " 
/pf amjnatch= "Acyl_transf n 
/pfam_matchs«SAM binding" 
/pfam_rnat ch= " adh_z inc n 
/pf amjmatchs «pp-binding n 
CDS 14275.. 15408 

/blast_scorese-104 

/blastp_matchs»AF285636.WCBT Burkholderia mallei" 
/genes "acyl- CoA transferase WcbT" 



22 /35 
/note= "putative" 

/pfam_match="PF00155; amxnotran_l_2 ; 1" 

CDS 15436.. 16245 

/blast score=5e-ll 

/bias ti_mat ch= " TTDEFFMT . FMT T. thermophilic" 
/gene= " formyl trans f erase " 
/no t e= " evidence experimental" 
/product- "methionyl - tRNA formyltransf erase » 
/pfam match** "PF00551; f ormyl_transf ; 1" 
/pfam>atch=«PF02911; formyl_transf_C; 1" 
CDS 16287.. 17384 

/note= "putative n 

/crene= " glycotransf erase " 

/blastp_raatch=»AF285636.WCBD Burkholderia mallex" 
/blast_score=e- 99 
CDS 17427.. 18158 

/blast score=2e-82 

/blast£_match="AF285636.WZM Burkholderxa mallex 
/gene="ABC-2 transporter Wzm" 
/note= " putative » 

/pfam_match=*»PF01061; ABC2_membr ane ; 1" 
CDS 18248.. 18847 

/blast score=7e-61 . 
/blasti_match-"AF285636.WZT Burkholderxa mallex 

/gene="ABC-2 transporter Wzt" 
/note= "putative " 

/pfam_match="PF00005; ABC_tran; 1" 
CDS 18952.. 20346 

/note= " put at ive ■ 

/gene« " glycosylt ranf erase " 

/blast score=e-101 t 

/blasti_match««AF285636.WCBE Burkholderxa mallex 

/pf amjnat ch= w Glycos_transf_l " 
CCS 20442.. 21167 

/note= "putative n 
/ gene= 11 unknow " 

/nx^stp^^ Agrobacterium tumefaciens" 

complement (21164 . . 24301) 
/note="tranporter domain" 
/gene= "unknow" 

/blast score=5e-29 . . _ . - 

/blasti match="AE009122.ATUl658 Agrobacterxum t*^*^*** 
/prosite - match="PS00402; BPD^TRANSP^INN_MEMBR; UNKNOWN^.!" 
Cos complement (24351. .27023) 

/note="none n 

, /gene= "unknow" 

/blast_score=5e-29 

/blastp_match=«AP003581.AL*R0267 Nostoc sp» 
CDS complement (27806. .29686) 

/note="none" 

/gene="cell surfarce protein" 
/blast score=2e-34 
/blasti anatch» ?^0aA748. MA0851 . 2567 Methmos^cxna 

acetivorans" 

/product»"cell surface protein" 
Cos complement (29535. .30872) 



CDS 
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/note** "none" 

/gene="cell surface protein" 
/blast score-2e-36 § 
/blastp_matcb=: n AB010748.MA0851 Metnanosarcxna acetxvorans 

complement (30848. .32647) 
/note=" fragment" 
/gene= " O- antigen" 
/blast score-le-72 

/blastp_match="AF105060.RFBC Riftia pachyptxla" 
/product= n O-antigen biosynthesis protein" 
/pfam_match-"PF00535; Glycos_transfJ2" 
complement (32574. .35555) 
/note= "put at ive ■ 
/gene= "glycosyltransf erase" 

/blast score=7e-46 . 
/blastp_matcb=:"AE013462.MM2213 Metnanosarcxna mazex 

complement (35533 . - 36598) 
/blast score=7e-37 

/blastp_matcb«"AE013462.MM2213 Methanosarcina mazex" . 
/gene= "glycosyl transferase" 
/note="putative" 
complement (36516 . . 37400) 
/blast_score=8e-22 
/blastp_match="AP003581.ALR0267 Nostoc Sp" 

/gene= n ALR0267" 

/note= "putative ATP/GTP -binding protein, esterase fusn 
Sequence 37507 bIT"s531 A; 13507 C; 13011 G; 5458 T; 0 other; 



CDS 



CDS 



CDS 



gatcccacga 

gtcacgtggc 

gtcaccttgg 

tcggcccaga 

tgcagcagcg 

tagaagagcg 

gcgcccttgt 

tgcacgccct 

tcgcggcgct 

tcgcgcagcg 

cagccctcgt 

tcgcgcgcga 

gcgcgatagc 

tcgagcacga 

tcgccggcca 

gatgggcttg 

gagataattg 

tcgaggccca 

tgcaggtggc 

accacccggc 

cgtcggttcg 

cgcgcttcct 

ggctcttcga 

cctcgatcat 

acatggcgag 

tgatcgccac 

gcgagctggc 



tcggcgccag 

ggccctcggc 

cgaagcccat 

agcgcgtgcc 

catcgtgcag 

aaaagtgccg 

tccagctggt 

cgatcaggag 

cggtcaggtc 

ccgtgatgat 

cctcgaagcg 

tgtcctcctc 

ccttcaactg 

agatcgcgta 

gcccgctcgg 

gcatggctat 

cttgatgaag 

gggcctgtac 

cgcccacgac 

cctggtcgcg 

cttcatctgc 

cggcaccgag 

gacgctgctg 

cgacggcgtg 

cctcgaggcc 

cgacggcgtg 

cgggcgctac 



cggattgcgc 

ctccagctcc 

gtgccttcca 

gtccttgcgc 

ctcggcctcc 

tccgatgatc 

cacgtggccc 

gcgcgagcgc 

gcgcgtgacc 

gacgttggcc 

gccgtcctcg 

ggtgtagaac 

ttcggcgccc 

gtccgtgacg 

ggcgccgaag 

tagggtacaa 

atgccatttt 

aagtccgagc 

gtgctcaact 

gccgcgaagg 

gggacgcacg 

gacgcgatcc 

ggcgaggagg 

cgcctctcga 

cagctgaagg 

ttctcgatgg 

cacgcgatgg 



agctcgtgcg 

tccatgcgct 

ctcgcgtcgc 

acgcgccagc 

gggcggccgg 

tcttcggcgc 

gcggggtcca 

tcctcgcttt 

ttgccgaagc 

cagaagcgcc 

cgtgcgcgca 

gcggagaaat 

gcgttccacg 

gacccgacca 

gcgaggttgc 

cggtgcacgc 

tggaattctt 

gcgtgctcga 

tctgcgccaa 

agacgctcga 

ccgtgcaccg 

tgtacggctc 

acgcggtgat 

aggcgaagcg 

cggcggacgc 

acggcatcgt 

tgatggtgga 



cgagcatcgc 

tgcggtgcga 

gaagcgccgt 

ccgtgtcctc 

ccgcggcatc 

tgtagccggt 

gcatgtagat 

gccgcagctg 

cgaccagcgt 

ggccgtcctt 

gcagctcctt 

ggcggccgat 

aacgcacgta 

gcagccggaa 

gcaccgaggg 

aagttgcaat 

acgcagggag 

ctctccgcag 

taactacctc 

gcgcgacggc 

cgagctggag 

ctgcttcgac 

ctcggacgcg 

cttccgctac 

ggccggcgcg 

cgcgaacctg 

cgactcgcac 



gaggaactcg 

gaggtcgcgc 

cacgacgacg 

ggcgcggcgg 

ctcgggcgga 

gatgcgctcg 

cgcgtagtcc 

ctcctcgtgc 

gccgtcctcg 

gcgcacgcgc 

ggcgggccag 

gatctcttcg 

gccctcggga 

gatggcgtcc 

ccgcggaccg 

cggacaat ca 

ctggggacgc 

ggcgccagcg 

ggcctggcca 

tacggcatgg 

acgcgcctcg 

gccaacggcg 

ctgaaccacg 

gacaacaacg 

cgctacaaga 

ggcgccatct 

gccgtgggct 
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tcat gggcgc gcacgggcgg ggaacgcccg agcattgcgg tgtcgaaggg aaggtggaca 
tcctcaccgg cacgctcggc aaggccctgg gcggcgcttc cggcggctac acggcgggca 
agcgcgagg? ggtggcc?gg cttcgcaacc gctcgcgccc ctatctcttc tccaacacgc 
SSSoSo SSoggge gcgtcgctca aggtgctcga tctcctcgag gaogsmog 
agctgcgcgc gaagctcgcg cgcaacgccc gccacttccg cggcgagatg acgcgcctcg 
glttcacgct ggccggcgcc gaccatccga tcatcccggt gatgctgggc ^ggccccgc 
tcgcgaagga gatggcggac cggctgctga aggaaggcat ctacgtggtc ^cttctcgt 
ttcccgtggt gccccgaggg caggcgcgca tccgcacgca gatgtcggcc gcccatgaac 
cgaagcacgt cgatcgcgcc atcgccgcct ttgccaaagt cgggcgcgat ttgggagtca 
ttgcttgaga accctttcga agacgaagcg cgagcccggc atctggatgg tcgagtcgcc 
caagcccgtg gtcggccaca acgacgtgct gatccgcgtg aagaagaccg ccatctgcgg 
caccgacatg cacatcttca actgggacga ctggtcgcag aagaccatcc cggtgccgat 
qacgitcggc cacgagtacg taggcatggt ggaggccatg ggccaggagg tgcgcggcct 
gcaggtcggc cagcgcgtct ccggcgaggg ccacatcgtc tgcggccatt gccgcaactg 
ccgcgccggg cgccgccacc tgtgccgcaa cacgcagggc gtgggcgtga accgccccgg 
cgfgltcglg gactacctgg tgattcccgc ggagaacgcg ttcccgattc . ccgacgacafc 
ccccgacgag atcgcctcga tcctcgatcc gttcggcaac gcggcgcaca ccgcgctctc 
gttcgacctg gtgggcgagg acgtgctgat caccggcgcc gggccgatcg f^atSTSTC 
cgcggccatc gcgcgccacg tcggcgogcg ccacgtcgtg atcacggaca tgaacgacta 
ccgcrtggcg ctcgccacga agatgggtgc cagccgcgcg gtgaacgtat cgaaggaaaa 
cctgaaggac gtgatgcgcg agctcggcat ggtcgagggc ttcgacgtcg gcatggagat 
gtcfggcgtg Icctcggcct tccgccagat gctcgacacc atgaaccacg gcggcaagat 
cgccatgctc ggcatcccgc ccagcgaggc cgcgatcgac tggacccagg tgatcttcaa 
gggcctggtc atcaagggcg tctacgggcg cgagatgttc gagacctggt acaagatgat 
SlcatStg cagagtggcc tcgacctctc gcccatggtc acgcaccgct ^gacgtgcg 
cgagtacctg aagggcttcg agacgatggg ctccgggaaa tccgggaaag tcgtcctgtc 
gtgggattag cgagggccac ggatcgcggc ccggttcccg cggcggcgat <**9ca&ra 
ggltggcagc acgccccggg atttccgccg tggctcacgt gcgacttcgc cgtcgacgca 
ccgatcgcga tcgacgtcga cgtgagcgtg fccgctgctcg atgcccaggg cacctcgctg 
tggacgcagt cgatggccgg cagcgggcgc gacaccccgt ggctaccgac cggcacgtat 
cgcgcctccc tgcgcctgga gccgttccgc ttcccggccg cggccgcctc ggtcgagttc 
gcgctctcga tgcgggtgca aggtgagcgc cgcgtcgtgg ccaccgcagc 9SSrcgt9Sfer 
cccgcgggcg caccccacgg cgtggcgcgc gcggcctggc acctcgaggc cctggatggc 
acgccggcgc tcgagacgcfc ggcctggtcc gatcccacgc acagctggtt ctcccagcafc 
ttcgaccatg cgtcgegcac gtgcgtcgag tacctgggcg ccggatggcc cgggtggcgc 
gggcgcgtgc tggatgcggg atgcggcgac ggcatcacgg cgctcggcat cgcgctgcgc 
tatgcgcccg agcaggtggt gggcatcgat ccggggcgct gctaccgggt gcttcccgac 
atcctcgagc gccacgccct cggccacctc gcgctgccgg acaacctgca gttcctcccc 
gcggatgcca acgcgctccc gttcgccgac agcagcttcg acgtcatcgt ctcgtggtcc 
gclgtcgagc atttcgtggg cggctacctg ccctcgctcg ccgaggcgcg gcgggtgctc 
aagcccggcg gcctgctggt catccatccc gagctctact acaccgcgca ccacggccat 
cacctcggcg agtacagccg cgagcccttc ttccacctgg tgaagtcgcc cgacgaggtg 
cgcgacatcg tgttcggcgc ggacgtgagc ctccacgacc gcggcggcat cgcgccgacg 
cgcgcggagc actggcgctg gtacaacgag ctcaatcgca tcacggccgc gcagttcgag 
gacgagttgc gcgcgctcga cttcgagccg tggcgcctgg cactgcgcgc cgagccgctg 
gtggagtaca agccggagct gctgcgctac cgcttcacgg agctgggcgt ctccgagctc 
tacgtggcct gcatcaaccg caagcccgtg tcagatggtt cctacaggac gcatgggcag 
gcggcattcc ggtaaggtcc gccgggacgg gaattgaagg cgcaagcgcg ttgttatccc 
cgtcactccc ggactgccgg gatcaccctc acaggagaac ccccgatgaa gacccagacc 
caggccgtgc tcgccgcggc cctgctgtcc cttgccggca ccacgctggc ccgcgccgaa 
gccgtcacgc tgacgggcgc gagcgaagtc ccgcccgtca cctcgagcgc gaagggcagc 
ggcacggtgg tcgtgaaggc cgattgcacg gtcaccgcga agatcaccgt caccggcatg 
accgccaccg cggcccacat ccacgaaggc aaggccggcg ccaacgcggg cgtggcagtc 
ccgttcgtga agaccgcgga- cacacggttcgaggcagccc.cgggqg.ccaa gatgaccgag. 
gcgcagtgcg ccgcgtacaa ggcgggcggc acctaogtga acgtgcacag cgaggcccac 
aagggcggcg aggtccgcgc ccagctggcc ggcaagtagg gcagccgctg caaaaagaaa 
aagcccgcgc aagcgggctt tttcgttgaa ggctttggtc ggggcgaaag gattcgaacc 
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ttcgaccccc 
tgaaattata 
tagcatccgc 
atcctctgcg 
agctcgccgc 
gacgtggaca 
agcgtgatcc 
taccacgaca 
gtgcgcaagt 
atcgacgagg 
gagcacatgc 
tacttcaacc 
cccaacaacc 
gtctacggca 
tgcgacctgg 
accgtcctca 
accttcgagc 
gacgtcgcga 
aagcgcggaa 
ggctacggtt 
agggggcggt 
ccggcgttgg 
ccgagtaaga 
agcaacacct 
cgcgaatccc 
gcgcgctggg 
tcgttcgccg 
tccccgcgcg 
gaagccctcg 
ttcgtgggcc 
gacgcgtcgt 
tacgacctcc 
ttccaccagg 
atcagcctgc 
aagcgcggcg 
gcgggcgtgc 
gccgtcgtcg 
agcgtcgatg 
tccgagatcg 
acccgcgccc 
ggttcggtga 
aaggcgctgc 
aaccccaaca 
catccgtaca 
cacgtgatcc 
gtggcgctca 
gcgaacttcc 
cactgccgcg 
gccggcgccc 
gcgctgccca 
gcaatggggc 
gacgcgctgt 
cccgccgacc 
ggcttgaccg 
gtgggcgagg 
gtgatctacc 
gtcggcctcg 



tgcaccccat 
cctgcatcgg 
ccatggcgac 
cactggccca 
gttccatcga 
tccgcgacgc 
acctcgcggg 
acaacgtccg 
tcgtcttcag 
acgcgcccac 
tggtcgccct 
cggtcggcgc 
tcatgcccta 
gcgactaccc 
ccgaaggcca 
cggtgaatct 
gcgtgaacaa 
tctgctacgc 
tcgaggagat 
gacggattgc 
tcttgtaaga 
aaggaagaag 
atcgtcagtc 
cgcatcgcgt 
tctggccctc 
cgaaggagcc 
ccggcaccafc 
aggccgcgca 
aggccgcggc 
tgtccaccgt 
cggccaccgg 
acggccccag 
cctgccagtc 
acctgcatcc 
cctgccgcgt 
tggtgctgaa 
ccgcgagcgg 
cccaggctgc 
actacctcga 
tcggcatcgc 
agagcaacgt 
attgcctcaa 
tccacttcga 
agaagctcgt 
tcgagagcac 
tgctgagcgg 
tgcaggcgag 
actggcacca 
tcgcgcaatt 
acgcgtccgg 
gcgccctgct 
tccgcccgct 
agctggaccg 
agctgctgcg 
tcgccgccgc 
accgcagccg 
ccgaggacgc 



gcaggtgcgc 
ggtcgcagcc 
gaccctcgtg 
ggccgggcga 
acgcgtgctg 
cgacggcatc 
actgaaggcg 
cggcaccgag 
ctcctcggcc 
ctcgccgcag 
cgccaagcgc 
acacgagagc 
cgtgtgccag 
caccccggac 
cgtggccgcg 
cgggacgggg 
actaagcatt 
gaacgcgggc 
gtgccgcgat 
aaatcatggc 
tccaaggttc 
gcaaccctag 
gagaccaact 
cgccgtcgtc 
gctcctcgag 
gtactaccac 
cggcgacatc 
gatggatccg 
ggtgaagccc 
cgacfcacagc 
caacaccgcg 
catggtcgtg 
gatcgccatc 
gtacgggttc 
gttcgacgcc 
ggacctcgac 
cgtgaatacc 
attgctgtcg 
ggcccacggc 
gctcgggcgc 
cggccacctc 
gcaccgcgcg 
cgagtggaac 
cgtcggcgtg 
gcccacttcg 
ccacgacgac 
cgagcaaccc 
gctccgcgcc 
cgccaagggc 
ccccgcattc 
cgccggcgag 
cggcggctac 
caccgagatc 
ccactggggc 
ttgggcgagc 
ctaccagggc 
cgcgcgccgc 



taccaggctg 
gccatgaafcc 
acgggtggtg 
cgcagcatct 
cagatcgcac 
cgcaaggtga 
gtcggcgaat 
agcctgctcg 
accgtgtacg 
agcccgtacg 
gacccctcct 
gcgctgatcg 
gtcgcctccg 
ggcaccggca 
ctggaagcac 
ctcggctaca 
gcccggcgca 
cgcgcgaagg 
gcctggcgtt 
cttggcgacc 
gggatgcgtc 
tgttgtcatg 
cgcctccggg 
ggcattgcgt: 
gggcgcaacc 
ccgcgcaagt 
agcggcttcg 
cagcagcgcc 
tcgaccctgc 
tacggcctcg 
agcatcgcgg 
gatacggcgt 
ggcgagacgc 
atcgcgttct 
agcggcgatg 
cgcgccgtgg 
gcgggccgct 
gatgtgtacg 
accggcaccg 
ctgcgcccgc 
gaggcggcct 
gtgccgccca 
ctgaagccgg 
aactcgttcg 
cccaaggcgg 
gcggcgctgc 
gcgctgcacg 
atggtggtcg 
cacaacacgc 
gtgtattcgg 
ccggtgttcc 
tccatcctcg 
gcccagccgc 
atccagccca 
ggcgcgctga 
gagaccaagg 
cagatcgagg 



cgctacgccc 
gaaacgcccc 
caggctacat 
gcatcgacaa 
ccggttgcgt 
tcgcagggcg 
cggtcgagca 
cggcgctcgc 
gcctcgccga 
gccagaacaa 
ggcgcgtagt 
gcgaggaccc 
ggcgactgaa 
tgcgcgattt 
tcgagcgcgc 
ccgtgctgca 
tcgtggggcg 
ccgtgcfcggg 
ggcaggaaag 
cttgctgcaa 
caggacgtca 
gctccagacg 
tgcaacgaat 
tccgccttcc 
tcgtcaccac 
ccgagcccgg 
acgcgtcgtt 
tgctgctcga 
gcggcagcga, 
cctacgacct 
ccaaccgcct 
gctcgtcctc 
gccaggcgct 
ccaaggcgtc 
gctacgtgcg 
ccgacggcaa 
cctccagcct 
agcgcgccgg 
tggtgggcga 
gccaccagcc 
cgggcatggc 
ccatccacct 
tcctcgagcc 
gcttcggcgg 
tcgaggcccc, 
gcgacgtggc 
acatcgcgta 
gcaccgaccg 
ccggcgtcgt 
gcaacggcgc 
gtgccgccgt 
agatgctgga 
tgctcttcgc 
cgcgcgt cac 
cgctcgacga 
gcgcgggcgc 
cgctcgggtt 



cgaccgaact 
gggcttcggg , 
cggcacccac 
ctactccaac 
cgaggccttc 
cgacgtggac 
gcccgagcgc 
cgactcgccg 
gaagatgccg 
gctcgacatc. 
gaacctgcgc 
ggccggggtt 
ggagctgagc 
catccacgtc 
ggcccccggg 
attgctcgaa 
gcggccgggc 
ttggaccgcg 
gaatgcaaag 
tgcggcatga 
aaaacagcac 
ggattgcgcg 
cgtgagccga 
cggcgcfcacg 
cgtcgatccc 
caccagctac 
cttcggcafcc 
gctctcctgg 
ttgcggcgtg 
ggcgtccatg 
ctcgtatttc 
gctggtggcg 
ggtgggcggc 

gatgctctcg 
ctcggaaggc 
cccgatcctc 
caccgtgccc 
catcgacccg 
tccgatcgag 
gctgccgatc 
gggcctggtg 
cgattcgccc 
gctcacgctc 
.cgccaacgcg 
ggccggcgag 
ggcctcgtac 
caccaccgtg 
,cgccagcafcc 
gagcggccgt 
gcaatggccg 
gcaatcggtc 
gcaggaggtc 
cgtgcaggtg 
cggccacagc 
fcgcggcgcgc 
gatgaccgcc 
gaagggcgag 
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5400 
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7080 
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7200 

7260 

7320 

7380 
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8280 

8340 
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gtgaacgtcg cgtgcatcaa cagcacgcgc aacgtgacgc tcgccggcac gcgcgccggc 
atcgaggcgc tcgaggccga gctcacgcag cgcaaggtgt tccaccgccg cctcgacctc 
gactacgcgt tccacagccc ggcgatggac ccggtgcgcg acgggctcgt gagcgcgctg 
cgcggcctca cgccgcgcgc cacccgggtc gcgttccatt cggcggtcac cggctcgccc 
gccgcgggca accagctcga cgccacctac tggtggagca acatccgcga gccggtgcgc 
ttccaggccg cgatccgcgg catcgccgag tcgggcgtca atgtcttcat cgagatcggc 
ccgcacccga tcctcaagaa ctacatcaac gacggcotgc gcgccgcctc gatcgagggc 
cgcgcgctgg tcacgctgca gcgctcggcg agcgatcgcg cggcgatccg cgccgccgcc 
caggaagtcg tgatcaccgg ctgcccggtg gagaccgcga agatgctccc cgcgcagggc 
cacttcgtcg agctgccgcc ctacccgtgg cagcgcgagc gccactggcg cgcgcccacc 
tcgcaggcct acgacctcat ccagcacggc aagcagcacc cgctgctggg ctatcgcctg 
cacgagaacg acttccagtg ggaaaaccac atcgacaccg cgctctatcc ggcgtacgcg 
gaccatgtcg tcggcggcgc cgtggtgttc cccgcggccg gctfccgtcga gatggcgctc 
gccgcctcgg cgatcgcgct gggcggcgac gcgcacgaga tcgagacgct cgagatccgc 
agcgcgctgc tgctcgagga ctccacgtcg aagaccgtgc gcttcgcgct ggaacccgat 
ggccgcttca caatccgcag ccgcgctcgc ctgagcgagg atccgtggca actccacgtg 
gtcggcaagc tggtcggcac gcccaccgag cttccgcgcg cgccgcgcca gtcggcgccg 
tcgcgcaagc ccgatgtgat cgcggcccag cactacgaag gcgccgcgaa ggccggcctc 
gcctacggcc cggccttcca gtcggtctcc aaggtgtggc tcgacgccgc ggacccgggc 
agcgcctacg cgcgcctgat cctccccaag ccgattcgcg gcgagctggg cgtgatgcac 
ctgcaccccg cgfccgctgga cggctgcttc cagctgctgg tcgacctgct gcgcgccgag 
gcctcccggc acgcgcagat cgccttcgtg ccgatcctgg tcgggcgcac gcgcctctac 
ggccctgccg ccctcgtcac cgcctgccgc gtccgcctga ccgcgcgcag cccgcgctcg 
ctggtcgcgg actfccgagct ctacggcccc aacggcgacg tggtggccgc gctcaccggc 
gtgcgcttcc gcggcgtcca gctgaagcag ccgcagggcg cgcgcctgcg ctacctgtcg 
taccgcgcca tcccgcgcct gaatggcggc gacacgccgg gcgccaagcc gctgcccctc 
gcgttgctgg ccgcggcgtg cggcgagcgc tggcattcgc tctcggccgg ctcgcgcaag 
cgctactacg atgaggtcga gccgctctcb gacgtgctgt gcagctcctt cgccgagcac 
gcgctgcgaa agcggatcgg cgaggccccg ctcatcgacc cggccgcgct cgccgcgtcg 
acgccggccg agcattggcc gctcctcaag cgcgcgatcc agatgctggt cgaggaccag 
ttgctcgagc ccgccgaagg cggctggcgc tggagcccgg tcaccgaatt gcccgacgcg 
caggagagct ggatcacgct gctgcgcgac tacccggacg aggccggcca gctgatcgcg 
ctcgggcgcg ccggcgctca cctcgcggaa gtgttcgccg gcaaggaagc gtccttgccg 
agcctgttgc cggccacgca ggcgaacgtg ttcgcgcgcc tgtgctcggg cgcacccgcg 
ttcgccaacg ccgggcgcgc catcgccgac acgctggcgc tcgccctcga gcgcctgccc 
gcgcaccggc cgctgcgcgt gctcgaggtg acgccgtgcc gctccgagat cacgctgcgc 
gtcctcgagg ccgtggacct cgaccgctgc gagtaccacg tcggctgcac cacggacgaa 
gcactgggcg agtacgaagg cgtcctcgat gccattccgc acgtcgagac ttgcgtcgtg 
gacctgaaga tcccgggcat gggcctgaag gcgccgaacc agggcccgtt cgatgtcgtg 
atcgtttccg atggcctgct cggtgcggcc gatccggatg cggccgcggc gcacctcgcc 10860 
ggcgcgctcg cggaagacgg gctgctggtc atggtcgcgc agcatccgtc gcgctggacc L0920 
gacctcctct tcggcgcgga cccggcgtgg tggacgctcg ggccctcggg ctcgcagcgc 
tcgcgcctgc gttcgccgga cgagtggcgc atcgcgctgg cccacctcgg cttcgggccc 
acogtcgcgg tgcccgagct gcccggcctc gagcgcggct cgtacgtcct catggcgcgg 
cgcgacggcc cggtgccgca ggctgccgag cgcagcgttc ccgccaagtc gggctggatc 
ttcgtgcagg aagcgggcgg ttattcggcg gcgttcggcg cctgcgtgca ggcggatctg 
cgcgcacgcg gccacgcggt gttcgagatg gcgcccgatt ccgtcgaggg ggccgagatg 11280 
gcgcagcttg cgcgcgcgcg ccagaogctg ggcgacatca gcggcatcgc cttcctggcc 1134 0 

ggcctgcccg acgacgcggc cgccacgggc gacgcggccg agcgcctcgc gcgccagacc 
gcgcgctgcg ccgcgctcgc gcgcctgctg cgcgaatgca aggcgagtgc catgacaccc 
gcggtgtgga tcgtgacctc gggcgcggcc tccagcctgg cgcgcaacgt ccccgccctt 
gcctcgcgcc gccgccctcc cgatccggac gaggcgatcg tatgggggtt cgcgcgcacc 
gcgatgaacg aattecccga gctgcgcctg cgcctcgtgg actgcccgaa cccggagaac 
ctcgagcgca acgccgcctc gctggtgcag ..gagctgctct cgcccaccgc cgacgacgag 
gtggtcctca ccggcgccgg ccgctacgcg ctgcgcctgg gcgcggtgcc gccgcccgca 
tccgtgcgca ccgctaccca gggcatcacg cgcctcgact tcgccgcgcc cggcccgctg 
aagaacctcg cgtggcgcgg cgacgtgcgc cgcaagccgc gcgcgaacga ggtggaaatc 
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gaggtgcgcg 
gacgaggcgg 
gtcaccgccg 
gcgccctcgg 
gccggctgga 
tcgttgaagc 
ggcggcgtgg 
accgccggct 
gactcgcgca 
gacgtggtgc 
ccgttcggcc 
ctgcgcccgt 
gagcggcccg 
gtgctctcgc 
tacatgcagc 
gcgcagcccg 
ctcgtcaccg 
ggtgcccgcc 
gcggtggccg 
gaccgcgcct 
ggcgtgatcc 
cagatccgcg 
cgcggcgcca 
ccgggccagg 
cgcgtcgagg 
ctcgcgcgcc 
gagtcggagg 
ggcgtgctcg 
cgcttccgcg 
gagctgcgcc 
ctgctgcgcc 
cagcccttgc 
gaggcgcgct 
aagctggtcg 
tcgcgcagcg 
caggtcgagg 
tgaaggcgcg 
tgctcgagca 
tcaacgcctt 
tcgacatgca 
gcattcccga 
ccgggcgcga 
ccgtggatgc 
aggctcgtgt 
tatggcgtgc 
ggccacctgt 
ctgatgggca 
gccctggacg 
gagggcgtct 
cgccgccacc 
cgcggcttcg 
ggcacgctct 
gtcgagcacc 
gtggcggccg 
gccctgcagg 
ggcctctcgc 
cgcctggccg 



ccgcgggcct 
tcgagggcgg 
tcggcaagga 
cgttcgccac 
cgttcgagtc 
acctggccca 
gcatcgccgc 
ccgacgagaa 
cgctcacgtt 
tgaactcgct 
gcttcatcga 
tccgcaacaa 
acctcgcgcg 
cgctgcccta 
actcgcgcca 
cccggcccgc 
gcggcctgtc 
acctggtgct 
acctcgaggc 
ccgtgcaacg 
acgcggcggc 
acgtgctggc 
agctcgattt 
ccaactacat 
gccagccggc 
acgagaaggt 
ccgcgctcgc 
acttcagctg 
agctgggcgc 
gcctcgccaa 
gcgaggtggg 
aggaaatggg 
tcggcgtgaa 
accgcatcgt 
accagatcga 
agctcacgca 
caacctccgc 
gcgcgtgcgc 
ccgccccgac 
cccggtctac 
tccgtacttc 
gtacatcaac 
cgcggccaag 
ccggcgagcg 
acgacgccgt 
tcgggccgcg 
tccagctctc 
agctgctgcg 
acagcatgga 
gcgcgttcct 
gcatccgcga 
ccaagacgct 
tgaagttcgc 
cggcgctcgc 
cgcgcggacg 
agggcatggc 
aggcgatgtt 



caacttccgc 
cttcgccggc 
cgtgacctcg 
gcacgtggtc 
cgcggccacc 
gctgcgcgag 
gatccaggtc 
gcgcgacttc 
cgccgacgac 
ctccggcgag 
gctcggcaag 
catctcgtac 
ccagctcttc 
ccgcgccttc 
gatcggcaag 
cgtgccgcgc 
gggcttcggc 
ggtctccaag 
ccagggcgcc 
gctgcfccgcc 
cgtgatccag 
gcccaaggtc 
cttcgtgctg 
cgcggccaac 
gctgtgcgtg 
gaaggaagcc 
ggtgctggag 
gcgcaacctc 
gcgcggcgac 
cgagctggac 
cgagatcctg 
catggattcg 
cctgccggtg 
gcgcgcgctg 
gcgcgtcgcc 
ggcggtcgat 
ggcctctcgc 
cgggtc^aga 

cccgatgccc 
cagcagatgc 
cgcgcgcacg 
ttctccagct 
caggccatcg 
gccgctgcac 
ggccttcatc 
cgacctcatc 
gggcgccaag 
cgcgcagcgc 
cggggactac 
gatggtggat 
ccacttcggc 
cgcctcgtgc 
cgcccccggc 
cgcgctgaac 
cttcttcctc 
cgtggtgccc 
ccagcgcggc 



gacgtgatgt 
gcctcgctcg 
gtcgcgcgcg 
accaccgccg 
atcccgaccg 
ggcgagcgcg 
gccaagtggc 
gtgcgcctgc 
gtgctgcgca 
gccattgcgc 
cgcgactact 
ttcggcgtcg 
aacgagctga 
gccgccaccg 
gtggtgctgt 
gagctggcgc 
ctgcgcaccg 
agcggcgccg 
acggtgatcg 
gaggtcgcgg 
gacggcttca 
ctcggcgccc 
tactcgtcgg 
tgcttcctcg 
ggctggggcg 
ctgcagacgc 
cagctcctgc 
cgccgcttcc 
gacgcgaagg 
gcctccgagc 
cgcatcccgc 
ctgatgggcg 
atgtcgctct 
aaggacccga 
gcgcagtacg 
gacgcgcaag 
aggcagtgaa 
aggacaagcc 
tgggccacat 
agctggtgaa 
acggcgtggc 
acaactacct 
acgaatacgg 
cgcgagctgg 
agcggccacg 
gtgcacgacg 
cgcatggcct 
cgccacttcg 
cccgagctgc 
gaagcgcact 
atggccggcg 
ggcggcttca 
ttcctctaca 
cgcatgcgcg 
gaggccgcgc 
gccatcaccg 
atcaacgtcc 



acgcgatggg 
gcatggagct 
gcgacgaggt 
attcggtcgc 
cgttcttcac 
tcctgatcca 
tgggcgcgga 
tgggcgcgga 
tcaccaacgg 
gcaacctgcg 
acgagaacac 
atgccgacca 
tgcagctctt 
aggcggtgga 
cgttcgccga 
tgtcgccgaa 
cgcagtggct 
agtcgctcga 
cggaggcgtg 
ccgcgctgcc 
tcaccaacat 
gccacctgga 
cgacgacgct 
aggcactcgc 
cgatcgggga 
acatgggcgg 
tcgccgacgc 
tccccaccgc 
acgacgacga 
tctcggcgat 
cggaccgcct 
tggagctgct 
ccgagcagcc 
acgccggcac 
cgcccgagct 
ccacgcagca 
ggagcagctg 
ctcggcggtc 
ctcggagcgc 
ggaaggtgcc 
cgcggatacc 
cggcttctgc 
gaccctcggt 
agcgcgccat 
cgaccaacgt 
cgttcgtgca 
tcccgcacaa 
agcgcgtgct 
ccgaattctc 
ccttcggcgt 
acgaggccga 
tcgccggcga 
gcgtcggcat 
aggagccggc 
gcgccgcggg 
gcagctcgat 
agccgatcgt 



cctgctctcc 
ctcgggcgtc 
gctcgcgttc 
gaagaagccc 
cgtgtactac 
cggcgccgcg 
gatcttcgcc 
ccacgtgctc 
cgagggcgtc 
cgcgctgcgc 
gcacatcggc. 
gctcatgaag 
cgagcagggc 
agccttccgc 
cggcgtgaag 
cgccacgtac 
ggtggacaag 
gaccaaggcc 
cgacatcacc 
gccgctgcgc 
gacggccgcg 
cgagctcacg 
gttcggcaac 
caaggcgcgc 
cgtgggctac 
caccgcactc 
ctcgggcctc 
cgcctcgccg 
ccggcgcggc 
gttcaccgac 
cgatacgcgc 
gaccgcggtg 
gtcgatcgag 
cgaggccgaa 
cgactcgcgc 
gggacgtccg 
atccagcgcg 
gagtccgcca 
tactgccgct 
gcgaagctgg 
acgcgcattg 
ggcgac ccgg 
ctcggccagc 
cgccgaggtc . 
ctccaccatc 
caacagcatc 
cgactggcgg 
gatcgtgatc 
gcgcctgcgc 
gatgggcccg 
catctggatg 
gaaggcgctg 
ggccccgccc 
gcgcgtggcc 
catcgacgtg 
ccgcgccggg 
ctacccggcc 
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gtgcaggaga actcggcgcg cctgcgcttc ttcgtgagcg ccacgcacac cgaggagcag 15360 
ctgcgcttca cggtccgcga gctcgccgac gcctggcgca agctctgagt ggcgggcccc - 15420 

cggctgcgcg tcgtcgtctg cacccacggc ggcctgaacg gcgcgctggt gctctcgcgg 15480 

ctgctggccg cccccacgct cgaagtctcc gcgctggtga tctccagccg cgcgcgcggt 15540 

gcccatgagt cgtatggccg tgccgcgctc ggatacgtgc gcgcgagcgg cgtcgcctat 15600 

gcgctgtacc tctggtgcgc cacggcgctc gccgacctgc tgctgcgcgg cacgtccgag 15660 

ggcccggtcg cgcgcatcgc cctcgcgcgc ggcatcccgc tcctcgccac gccgcgcgtg 15720 

aacgatgcca ccgcgcgcgc cttcafccgcg ggggcggcgc cggacctgat cgtctccgcg 15780 

ttcttcaacc agcacatcga cgccgacgtg gctgcgctcg cgcgcgtggc cgcggtcaac 15840 

atccacccgt cgccgctgcc gcacttccgc ggcgtggatc cggtgtcctt cgcgcgcctg 15900 

cgcggtgccg agcgccacgg cgtgagcgtg catcgcatcg aacccggctt cgataccggc 15960 

gcgctgctcg cccaggaaac cgacgtcgag gcccccggca gcgtctttgc cgccaccgcg 16020 

gcgctctatg accgcggggc ggcgctcctc gccggccgtg ccgcggcgct ggccgccgac 16080 

ccgcgcggaa ccccccagcc ggcggggggc tcctacgact cctggcccac ccgcgcccag 16140 

gtcgccgcct tccggcgcgc cggcgggcgc ctgctccgcg cccgcgacct gfcggcgcctc 16200 

gcgcgccggg ggccggccgc tttcgtaata gaatcagcgc ggtagcggcg actcacgccg 16260 

cgcttttccc cgaccccccg aggcccatga aacactggct gaagcaacac gcaatcttcg 16320 

cgcfccaccgt cctgctgccg accgfcggccg cgatcctcta cttcggcctg atcgccfcccg 16380 

acgtctacat ctccgaatca cgcttcgtgg tgaggagccc ccagcgccag gtgcagaccg 16440 

gcctggtggg cgccctgctc tcgggcaccg gcttctcgcg ctcccaggac gacacctact 16500 

cggtgcacga cttcatcacc tcgcgcgacg cgctgggcga gctggacaag aagctcgccg 16560 

tccgcaagct ctacacggcc gccaacatcg acttcatcaa ccgcttcccg gggcfccgact 16620 

gggacgacag cttcgaggcg ttccaccgcfc actaccagaa gaaggtcacg atcgacttcg 16680 

acaccgcgtc ctcgatcacg gtgctgcgcg tgcgcgcctt cgagaaggcc gactcgcggc 16740 

gcatcaacga cctgctgctg cagatgggcg agcgcctggt gaacgagctg aacgagcgca 16800 

gccgccagga cctgatccgc ttcgcgcagg ccgaggtctc gctcgccgag gacaaggtga 16860 

aggatgccgc gctggcgctc tccgccttcc gcagcaacca gtcggtgttc gagcccgacc 16920 

gccaggcctc gatccagctg cagggcgtgg ccaagctcca ggaggagctc atcgccaccg 16980 

aggggcagct ggcacagctg cgcaagctct cgcccgacaa cccgcagatc ggcgcgctcg 17040 

agaacaagfcc ggcggcgctg cgcgtggcga tggcgcgcga gtccgccaag gtgaccggcg 17100 

gcagcggctc gttcagcgcg cgcgccccgg cgttcgagcg cctcaccctc gagaagggct 17160 

tcgccgaccg ccagctgggc gttgccctca ccgcgctgga gaccgcgcgc agcgaggcgc 17220 

agaggaagca gctctacctc gagcgcatcg tgcagcccaa cctgcccgac gaatcggtcg 17280 

agccgcgccg catccgctcg atcttcaccg tgttcgtgct gggcctggtg gcctggggcg 17340 

tggtgagcct gctggtggcg agcgtgcgcg agcacgtcga ctaggccgtg gagcagaacc 17400 

cttccctcct gcgcgccctc ggggtgcagc ggcgcgtgct ctacgcgctg ctgatgcgcg 17460 

aggtcatcac gcgcttcggg cgcgacgacc tcggcgtgct gtggctggtg gtcgagccga 17520 

tgatcttcac gctcggcgtg acggccctgt ggaccgcggc cggcatgaac cacggctcgt 17580 

cgctgccgat cgtggccttc gcggtcaccg gatactccgc ggtgctggta tggcgcaact 17640 

gcgcctcgcg cgcctcgatg gccatcgagg ccaacaaggg cctgctcttc caccgcaacg 17700 

tgcgcgtgat cgacgtgttc gtgacgcgca tcatcctcga gatcgcgggc gccacggcgt 17760 

cgttcaccgt gctgggcatc ttcttcafcct ggatcggctg gatgccgatg ccgatcgaca 17820 

tgctcaaggt cgcgttcggc tggttcatgc tgtcgtggtt cggcgccagc ctggcgctcg 17880 

ccatcggcgc cggcaccgcg tactcgctgg tggtcgagcg gctgtggccg ccgacggcct 17940 

acctgctgtt cccgctctcg ggcgccgcct tcatggtcga gtggctgccg gagaaattcc 18000 

agcagttcgt gctcttgctg ccgatggtgc acggcaccga gctgctgcgc gacggctact 18060 

tcggcaacgc ggtgcgcacg cactacgacg tgggctacat ggcaacggtc tcgctgggcc. 18120 

tgtcgctgct cggcatgcac ctggtgcgca tggcggccaa gcgcgtggag gccgcgtgat 18180 

ccgggtcgag aacgtgacca gatgtactca cgcgcctcgg cccagaacgt gctgacgcgt 18240 

cagcttcgag ctcggccgcg ggcgcaacat cggcatcctc ggcaagaacg gcgccggcaa 18300 

gtccacgctg atccgcctga tcagcggcgc cgagagcccc accaccgggc gcatcgtgcg 18360 

cgagatgagc gtgtcctggc cgctcgcctt cggcggcgcg ttccagaccc acctcaccgg 18420 

gctggacaac ctgaagttcg tctgccgcat ctacggcgtc gactaccgcg acaaggtcgc 18480 

. cttcgtcgag gacttcaccg agctcggcgt gtatttqqgc. gagccggtgt t„ccactactc X95.40, 

gcacggcatg atcacgcgcc tcgccttcgc gctgtcgatg gcggtggagt tcgactgctt 18600 

cctgatcgac gaggcgatgg tggtgggcga cgcgcgcttc cacgagcgct gccacgtcga 18660 

gctcttccac aagcgcaagg accgcgcctt catcctcgtc acccacgacg ccaaggtgat 18720 
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caagctctat 
cgtggacgcc 
tgcctgaacg 
ggccacgccg 
ggcatcgacg 
ccgcgcggcg 
gtctccacca 
cagctgctgg 
gtctggcagg 
gccggctttc 
cgcaagctcg 
gccgagacgc 
gacgcgatcc 
cactaccggg 
gccacgcgca 
cacaacatgg 
atcctgcgaa 
gtcgcacgat 
ctcgccgtgg 
cacctgcgca 
ggctacgagg 
ctcgaggacg 
tgcccgagct 
ccggtgatcg 
tgcagcccct 
gcgacgggcc 
cccgaagcca 
ccgtgagcga 
acttcggcgt 
gcgaattcct 
acatcgcgcc 
tcgtcatccc 
tgctggccga 
tctcggacga 
tgttccagca 
tggcgagcgg 
cgtacggcgt 
ccagccgggc 
gggcgccgga 
cggagatgct 
accacggcgg 
cgaaggagag 
ccgcgacttc 
agtagtgcac 
cgggcccgac 
cgaggggcag 
cgtcggcgag 
gcgaatcgcg 
gggtccagaa 
ccccggagag 
tccgcaacgg 
ccgagcgcgc 
cgggatccac 
gcgcgtgcag 
gctccagcgc 
aggtcgcctg 
gcgccgcggc 



tgcgagagcg 
gcctacgagt 
cccccgcgcg 
gcatcccgca 
tggagggcct 
gcggcgacct 
agcaggagct 
gcgcgcgcga 
cgctcttcgc 
gcgtcgcgcg 
gctacacgct 
cctacccggc 
cgctcctcat 
cgctgcgccg 
aggacctgct 
tctcgcacca 
cgcgcgcgag 
cgccctcctt 
cgacgatcga 
cctcgcgctt 
cgctggtagc 
tgcccgcccc 
tcggcgaggg 
cctcggagat 
attccgtggc 
tgcgccaggc 
tcctgccgca 
tccgaaggcc 
gatcaccaac 
cgcgtccggc 
cgacttccgt 
cttcgccgcc 
ggctgcgcgc 
ctcgctcacc 
catcgatccg 
cggcgtgggc 
ggcgccaccg 
ggagatcaag 
accggcaccc 
gttcctcgtg 
cgagctgggt 
gcccagcgcg 
ggcagccacg 
ccagtaggag 
gcgcccctgg 
gacggcgagc 
gcgcatcagc 
cagcatcgca 
cgaaggcgcc 
ccgggacgcg 
cgccagcacc 
gaactgcatc 
cgcgccgagc 
ccgcaggcga 
accctcgacg 
cgcgccgggg 
ggcgtcgcgc 



cctgcgtgct 
tctacatgaa 
gccgctgcgc 
ggaggcacgc 
gctgcagcac 
cgcaccggac 
caccaacgcg 
gaccctctac 
gcgcacgctg 
cgtgccgtgg 
gcatccgcgc 
gcgggtcgcg 
gccgcacacc 
caacgtggcc 
ctcggtcttt 
ctacttccgc 
cgagcgcatc 
cctcgcgcgc 
gccgcgcaag 
cccgggcctg 
caagttcaag 
cgagctgcgc 
cttcgatttc 
cgccgcgcac 
cgacctcgcc 
gctggtcacg 
gtggcgcgag 
accgaccagg 
ccgcgcttcc 
cgcgtccatg 
ccgcgcacca 
caggccgagc 
aactgcgccg 
gggctgccgg 
gcgcgcgggc 
gcgttgcact 
cccgagccgc 
gccgcgagca 
aatcccgacc 
caggaagccg 
ctcttcctct 
gtgatcgtcg 
cgctcgaggt 
tccggacagg 
tgcacttggt 
agcgcctcgc 
gtgggcaggg 
gcgtcgccgc 
gcggcatcga 
gcccacaaca 
gcggtggcga 
agcgattcgc 
gcgcgggcgg 
tggaacgcgg 
gcgccatcgg 
aggttcgtgc 
gcgccactga 



gcacgagggg 
cgaggtcatg 
gtgctcctcg 
ctgctgttcc 
agcggccacg 
cgccagctga 
cacgtggcga 
cgcttcgacg 
cacgccgcgg 
acgggcatgc 
atcgacactt 
gacggcacgc 
atctccgaca 
tcgggcgcgc 
cccgaggtcg 
gagaccgccg 
aagggcgcgc 
gccgcgcaac 
aaccacgcgg 
cgcctggtgg 
ccgtggctcg 
ctgctctact 
tccggcgtgg 
cgcgaggtgt 
gaggcgatcg 
cgcggcaccg 
tacctgctcg 
cgtgggagga 
gccgcgggca 
ccgactacgt 
tcctcgattt 
aggtgaccgg 
agcagggcgt 
gccccttcga 
gcgacatctt 
tcgtctatgc 
cgccgccccc 
aggcgcgggc 
ccgacatgca 
gcgtcacgcg 
tcttccgcaa 
cggcgtgctg 
cgccgcgcga 
ccgacgcgat 
gcgaggccca 
tcaccgccag 
actccggcgc 
gcgcgagcgc 
gcgcgcgcgc 
ggttcgcctc 
cctcgcgcga 
cgagcgacga 
cacgcagcgc 
cccactccgc 
tgcgcccgcg 
cttgcgcgac 
agtcgttggc 



cggctgctgc 
caggacctcg 
agctgcggcc 
gcggcctgcg, 
tgctcgccaa 
accgcctgtc 
cggcgctgat 
cggcgcactt 
acttcgactc 
accgctgcgc 
ccgatttcga 
gcctcgtggt 
tgtcctacca 
acttcgcgtg 
aggcgcgctc 
acgccgggcg 
ccaacggagc 
ccagcgaccc 
gactgatcgc 
tggtcggcat 
cgcgcggcca 
cgcacgcgcg 
aggcgatgcg 
accgcgacgc 
ggcgcgtcat 
aggtctcgca 
gcacggtgcg 
gtggggccag 
gatggacgag 
gatgcgcatg 
cggctgcggc 
cgccgacgtc 
ggccaatgcg 
cctcgtgcac 
ccgccgcctg 
gaagcgtatc 
gccgccgccg 
cgcgctggcc 
gatgaattcc 
cttccacgtc 
gccctaggcg 
cggggcgacc 
cccgtacagg 
tcccagcgcg 
ttgcaggttc 
cgcgcgcgcc 
aagcgcgtgc 
caggcgggcg 
gacctcggcg 
cgccgcgtgc 
accctcgcgc 
atgttcggcg 
cgcctcctcg 
cgacatcgac 
cgcgatgtgc 
gagcgggtcg 
caggccgatc 



ccttccccac 
ccccggaggt 
ggcgttcgac 
catgatcgag . 
gggcctgccg 
gcgcgtcgtg . 
ggccattcgc 
ccgcgacttc . 
ggtcacccgc 
gctggtcacg . 
cgtgctgatc : 
gcgctaccac 
ccaggcctcg 
cgtctccgaa 
cagcaccatc 
catcgaggag 
cgccaccgcc 
cggctacctg . 
cgcgtgggag 
gcccggctgg 
gctcttcgtg 
tgccacggtg 
ctgcggcagc . . 
cgccgagtac 
cgacccggcg 
gcggtacacg 
cgcggaggcc 
cgcgacccct 
gaagcgaagc , 
gtgcacgcgc . 
gtgggccgcc 
tccccctcga 
cgcptcgtgg 
tcgttcatcg 
ctgggcaccc 
tacgcggcga 
cagcctccac 
acgcaggcac 
tacccggcgg 
gagttcaccg 
tccttcgagg 
gccgccgcgg 
cgcacgcgcc 
cgctcgacct 
acggagggct 
tcatcgccgt . 
gcggccgcga 
cggtggtagt 
cttgcgtcgc 
gccggcacgc 
gacgagaagg 
cagagccgcg 
tcaccgcgaa 
gccgccccgc 
aggatggccc 
aggaccgcga 
agcgccccgg 



18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700. 

20760 

20820^ 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

2PL300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 
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cgagctcggg 
cgcgcagccg 
tcagtgaatc 
cgccccacag 
tcagctcgac 
cgaggctcgc 
tgtcgtccat 
ccgcgcgatc 
ggaatcccag 
tccagtcgcg 
ccaggtccgc 
cgtggccggc 
cgcgcacctc 
cgcagtactt 
agcgcggctc 
acacgaagcg 
ccgcgatgcg 
cgaagccgag 
tggccacgct 
gcaactgccc 
ggatctcgaa 

tggtgcggcc 

cgaacgggcg 
cggtggccgg 
gcagccgccg 
cgaagcccag 
actccaccgg 
cgagcagcca 
cgaacgcggc 
cgcgcgggcc 
cgcccgggcc 
cgcccgccag 
catggaagat 
ccacgtcata 
gcacgcagac 
cgtcgaggat 
tgaaaaacgg 

gggggtgcga 

ccgttcatca 
ctgtcgccgt 
tcgcccgcgg 

tggcggaaga 

cacccgctgc 
aacaggtgcg 
aggtcgccca 
gtcaccgtgg 
aggtccgcct 
gccccgaaga 
cgtccgtcgg 
tggctcacga 
ccgttcatga 
aggtcgccat 
ttggccgagc 
ttctgcagca 
ggcggcgtga 
cagttggcgc 
aagttcgaga 



ggccgcgcgg 
gtagaggcgc 
gaacgagcgc 
ggtcgccgca 
gtcgccgaag 
cttgtactcg 
cagcgaggcc 
gtcgtagggc 
cttgcccagc 
cagcgccggc 
gtccgggcgg 
ggagaagatc 
gagacgctcg 
gtcgatcgcg 
ggcgcccttg 
cggccgcgaa 
ctcgaactgg 
cttcttgtac 
gcggtagccg 
catgtagggc 
ctcggtgtcg 
cggccgcgtc 
caggtcggcg 
gccgctcgcc 
cggcgcgacc 
gcgccgcacg 
ccagcggcgc 
ctgccggcga 
ccacgccagc 
atgcaggtac 
gaccacgcgg 
cgccacggtc 
ggcgagcagg 
gcccaccacg 
cagcgtgccg 
catgggacca 
cgaggcccgg 
ccgaccagcc 
ggcgcaccgt 
tcaggtccag 
tgatgatgga 
ccaggtcgtc 
cggccggcag 
cgcggccgtc 
ggtggcttac 
tgccgtccat 
tgccgtcgcc 
gctgggcgct 
tgtggcgcag 
tccagccgag 
tgtacgcgta 
tgaggtcggc 
cggtgatggc 
-cgaggtcgct 
cgccaccgga 
catccggggc 
aacgccacac 



aaggcggccg 
gcgcggtggt 
tcgacctcgc 
tgcgcggcca 
cgccgctcgc 
cgataggcat 
gtgagccgct 
agcgtggccg 
acgcggggtg 
acgttcgagt 
ccctgcgcct 
ggcacgtggt 
accagcgagc 
ccctggcgca 
tcgcccaggt 
tcgtcgagct 
cgcggcgcat 
acgcggtcgc 
tggcggcgca 
agcgcgcggc 
caggtgttgc 
gcgccgagga 
agcgactcca 
gagcgctcct 
gggcgggcga 
ttcacgttgt 
aagcgtccgg 
tcgtcgtacg 
gccgcgagca 
gccggcgcga 
ccgaagaacg 
cccaccgtcg 
ccggcgttcc 
aggcgccgct 
cccagcagca 
ccagggacgt 
gggcctcgcc 
gccggggccg 
gacgctgccg 
cagctgcgtc 
aacaccatcc 
cttcgcgtcg 
gagctgggcg 
ggtgtggcgg 
cgtccagccg 
gatgtagatg 
gttcatgtcg 
gccgatgatc 
gatgatgtcc 
cccggcgggc 
ggcgcggcca 
cgtctggctc 
ggtgccgttc 
cctgccgtcg 
aaccggcagt 
cggcgggttc 
cagcgfcgtgc 



ccgcgtcgcg 
aatagcccca- 
cgcccacgtt 
tcggctcgtc 
gcaacgcgag 
cgcggtcgcg 
ggtcgccgtg 
cgcgcagcga 
ccaggtcgtg 
agagccacaa 
cgaagatcgc 
cgccgaagat 
gcagcgcgcg 
cgtgcccggc 
agggtccgtg 
ccgcgatcac 
gggcgaacgc 
ggttgtagaa 
gcatgcgcgg 
agccttcctc 
cgccgatcac 
acggcagcgg 
tcatcaggaa 
ggcgctccag 
tgctcccggc 
ggtcggtggg 
cgacgaaggc 
cgtcgtgcag 
ctgcggcagc 
acaccagcgc 
cgagcaggcc 
cgagcgcccc 
aggccaggcg 
cgagggcggc 
cgccgaggaa 
cttccggcca 
gtggagggaa 
atcaggtttg 
tcggtgtggc 
acgttccaac 
atcaggcgga 
ccgttcaggt 
ctggcgccga 
aagatcatgt 
ctggcggccg 
tagccgcggc 
cccgcctggg 
gtcgtgccgt 
gccttgccgt 
agcagctcct 
tcggtgtggc 
accgtccagc 
atcgtccaca 
ccgctcaggt 
gccagcccgt 
caggtcggcg 
gcgcccgcgg 



ccccaccgcc 
gaacgggtcg 
gccgaggtcc 
ggggacgaag 
ctccagggcg 
gtcgtattcc 
aacggtgtcg 
cccgaacgcg 
ggcttccatc 
cggcgtgcgg 
cttccactcc 



cgccagcacg 
gtcggtttcg 
gaggaagtgc 
gcccatcagc 
ggcatccatc 
ctcgtcggcg 
ccagcggtgg 
caccgcatac 
gcccaatgcg 
cttcacgcgc 
gtcgcgcgcg 
gatgatgtcg 
cagccgctcc 
gaccagcacg 
cgaatacgtc 
gacgccgagc 
gtcgatggcc 
ggcaagcgcg 
ctcgcgagcg 
tgcgttcacc 
gaccgcgacg 
ccgcgcgcgc 
ctgcagcacc 
acccgccaaa 
gtctataacg 
ccgcgcaacc 
cgctgccgat 
ggaagaccag 
ccgtgcccgc 
tgtgcacgcg 
cgcccaccag 
aggtggtgcc 
ccgcgcgtcc 
agaggaagcc 
cgtcgctgtg 
tgagcgtcca. 
ccatcaacca 
cgccgttcag 
tgccggccgt 
ggaacaggat 
ccgcgccggc 
ccgcgatgcg 
ccgagacgac. 
cgatccaggc 
agttgcaagg 
tcaccgggaa 



gcgaggtcgc 
gggctcgcga 
acctggtgcg 
gcgcggtgga 
ttcagccgct 
tgcgcggcac 
tatgcgtagg 
gcgcggtcgc 
tcgcccgggc 
tagagctgcg 
gcgagtccct 
tagggccgcc 
gccatcaccg 
ggcgccacgt 
gagacgccga 
acggccgaat 
tggaacgccg 
aacggatgga 
acccggtcgc 
atgccggtga 
aggcggccgg 
agcggcacgc 
gggcgctccg 
acgccctcgc 
gcgtgcagga 
atcgcgtcga 
gcggcgagca 
tcggtgatct 
cgcaccgcca 
ctcaggaaat 
gcgtagagcg 
aagagcagcg 
gccacgggtt 
gcgtaggcga 
ccagccgtgt . 
ccccgcccca 
tcagggttgc 
ggtggcgagc 
gtccttgcgg 
cggcagcgca 
gccgtcggtg 
gttcaccgac 
gttcatcagg 
gtcgccgttg 
cgctccggcc 
catgaagacc 
gccggtcccc 
gatgtgcgcg 
gtccgccgtg 
caccgtagtc 
gtcggcgcgc 
gccgatcagg 
cccgtcgctg 
.cggcttggtg. 
gcggtcggcg 
cgtggagagg 
cgacaccagc 



22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
. 25380.. 
25440 
25500 
25560 
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ttccagcccg 
tcgaacttgc 
agcagcgtgc 
gtcaccacct 
gtccatgcgt 
cacggcgaga 
cccgtcgcat 
gccaccggga 
gaggtcgcgg 
gagatggcac 
tccgcggccg 
accgtggact 
gcctgcgcgg 
ttggagaagc 
ttcgccgcgt 
gagccgttgg 
tacgagtacg 
gaggggcggt 
caaccgaaca 
ccgatccagg 
tccgcgccgt 
gcgtcgcccg 
acctgcgtcg 
ttggcgcgcg 
aagccgcgcg 
gcgagccgac 
cgacgcgcga 
gcatgctggc 
ggcggtactc 
cgafccgcgcg 
cgccgtgggc 
gctcgacctc 
gccgggcaac 
ggggaaccgc 
ctgtgatccg 
cgctatgtcg 
gggatgctcc 
gaactccttg 
ggccgatcaa 
gcccatcgtt 
gcgtcaccgt 
cgttcatcag 
cgtcgccgtt 
cggcgctggc 
ggcggaagac 
agccgctgcc 
atgcgtagcc 
tgtcgccggt 
tgatcgcggt 
gtcggccttg 
ggggaacagc 
cccgtcggtg 
tgatcacgct 
cgttcatggt 
cgtcgccgtt 
cggagaaacc 
acgtgcggcc 



aggccgtggt 
cggcgttggg 
ccgcgaacga 
gcgccgaacg 
tggcgccgcc 
ggatgatgcc 
tggagagcgc 
ccgtgatggt 
tgtagtcggc 
ccgccgagcc 
cgaaggtggt 
tgagcgccga 
tgccgcacgg 
ggtagacctt 
cgttcgtcgt 
gcagggccga 
cgaacgcacc 
cgtggcggtt 
ggcagccgga 
ctgcgccgtt 
attgcgtgcg 
gggtgatggc 
cattcaccag 
tgacgaggaa 
agtacacgat 
ggtattcacg 
cttcgagccg 
ctgctcggcc 
accctgcggg 
gtggttgatg 
ctggagcaac 
ggagcgcgga 
ggcgctgccc 
ggcgaaggcc 
ttgcgtgaat 
cggagatgtg 
aaaaggacta 
tggtctccgc 
gtccgcgctg 
gtggtggaag 
ccagcccgcg 
gcgcacgtgc 
gagatctccc 
gccgaacgtc 
gaggtcggcc 
gggcgcgagg 
gcgcccatcg 
gccgaccacg 
gccattcatc 
ccgtcgccgt 
tccttgccgc 
tgctggaagt 
ccagccggcg 
ccacgcggcg 
caggtcggtc 
cgtgagcaac 
ggccacatag 



gcccgcgtcg 
gaagctcgac 
gaggtcggag 
aaggctggtc 
cgaggtcgtg 
ggtcgcggtc 
gacgttgaac 
cttgggcgtg 
gcccgaagtg 
gccgatgcgg 
gccggtgaac 
gagtgccgcg 
ttgcggcgca 
ggtcgtcgag 
gcactcgggg 
gttgcaggtg 
ttgcggcggg 
gcccatggca 
agcaacggcg 
gccgccgaag 
gatggcctcg 
atgcatcgcc 
gcgcagcgtg 
gttcaggcgc 
catcaggtcg 
ccggggatgg 
gtcgtggcca 
gtcatgtcga 
gtcgagaaca 
ccgtcggtct 
tcgaagacat 
ctcagggcat 
ggcgccggca 
acggaagaga 
gcgttcatgt 
gacaaaaggt 
cgagggtgct 
gcgacttaag 
ccgatcgtgg 
accacgtcca 
aacggcggca 
gcgcggccgt 
accaggttca 
gtgccatcca 
ttctggtcgc 
aagcccacgc 
acgttggaga 
ttccatcccg 
agccagatgt 
tcaggtccgc 
ccacgaccgt 
agatgtcggc 
ccggcgggaa 
atgcgcccat 
gccgcatggg 
gtcttgtacg 
gccgcgcccc 



ctgaacacca 
acgcgatagg 
ttcaccgtcg 
gcgccctcga 
aagccggcag 
gccgggccgc 
gtctcgttgc 
gtctcgccgt 
gcggtgccgt 
ctcacggtga 
tgcaccgaac 
cggttgttgt 
ccgacacccg 
tcatgcagca 
cgctggctca 
aggttgccgc 
ctcgcggtgc 
tggccgagct 
taggcaaagt 
tcggcgccgt 
atgttgccga 
gtggtgtcgt 
atggcgacct 
gtcatcaggc 
atgacgttct 
cgatcagggt 
cggcgtggtc 
ccaggtagtc 
cgccgaagga 
tgctcttcgc 
acgagtggcg 
ggagggccgc 
tcgcggcggc 
agacgatcgc 
ggagctccgg 
atcgagcggg 
acggctgggt 
gttgcgaggg 
tgacgccgtt 
tgcgcccgtc 
ggatgtcggc 
cggtgtgcgc 
ccgaccatcc 
tcaggaacag 
cgctgaagtc 
cggcggtgat 
agacgatgtc 
agcccgcggg 
gggcgcgtcc 
cgtgtggctg 
gagcccgttc 
ctggctgtcg 
tcaggttcgc 
cggtgtgctg 
tcgggaacgc 
cgccgggcgt 
gcgcatcgac 



ccgtgccgtc 
cgaacgcgac 
tcgagttggt 
aggcctggtc 
gcagcgtgcc 
cgatgaagac 
cttcaccgat 
tggcccagct 
tggaggtggc 
gcgtcacgtt 
cgggaacggc 
tcagcgagag 
tcgtgcacga 
gcgacatgat 
cgccgcaggc 
tcgcgcaata 
cgcccgcttc 
cgtggatgaa 
tggggtccgg 
tgcgcatgaa 
acgtcgcggc 
cgatggcgtc 
cgctgtcggc 
tgccgccgag 
gcggcgtggg 
ctcggggctg 
ggcgccgagg 
gacggtgccg 
gccttcaggc 
gatccacgac 
caccccgttg 
ggcgttgaag 
gggtgcggag 
cagcgcgccg 
aagttgaccc 
cgtgacgacc 
tggtggcgaa 
aaccaccgac 
catcaggcga 
gccgttgaag 
ggcgccgagg 
gaacacgagg 
ggtgcccgcg 
gtgtgcgcgg 
cgcgacgtgg 
cgccgtgccg 
ggccttgccg 
cagcaggctc 
gtccgtgtgc 
atggtccagc 
atctggtacg 
ccgttggaat 
cgaaccagtg 
gaacaggaag 
caccttcgcg 
ggtcggatag 
gccgacgccg 



gatcaggaac 
gttgcccgcg 
cgggtcgctc 
ggagccgacc 
acccgtgggc 
acccgtggaLa 
accgtcgctc 
gagcgagccc 
gtagctgacc 
gccggcggtt 
gggcggggtg 
cgcgtcgttg 
gaggcccggg 

ggtggtg aa sr 

gccgccgccg 
gaagtggccg 
ccaggcctgc 
gacccactcg 
cgtgctggtg 
ggccaccatg 
gtcgaagctc 
gtcgtagttc 
atacgcagtg 
gcgctggccg 
cgtcacctgg 
ccggtcgcga 
ctgatgatcg 
cccggaatca 
ccggtcgtga 
gtgatgccgt 
gggagcgaca 
cgcaccgctt 
accaggatct 
aggaaggcgg 
atgcccaatc 
cgcccccgga 
ggccgaagga 
cacccaccgg 
atcgtcacgc 
tcgaggagct 
atcgcggtgc 
tccgcccggc 
ggcaacagct 
ccgtcgttgt 
ctcaccgccc 
ttcatgatgt 
tcgccgttca 
gcactgccgc 
tgagcagcag 
ccgtgccggc 
tgtagatgcg 
ccgcggccat 
atcgtcgttc 
tcgctctgcc 
aggaaggcat 
ttcgccgacg 
taccccacct 



25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 . 

27180 

27240 

27300 

27360 , 

27420 

27480. 

27540 . 

27600 

27660., 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 

28320 

28380 

28440 

28500 

28560 

28620 

28680 

28740 

28800 

28860 

28920 

28980 
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cgacgttgtt 
ggaacgcgtc 
cctgcgtgta 
tgfccgtaggc 
gcgtcacgaa 
tgggcaggcc 
cggtcacgtt 
caccgatctt 
acgtcgagga 
cgcggccgat 
acgagagctg 
gcgctgccgc 
ttgtccacgg 
atgacgttgc 
gtgcaatcgg 
gcgacgaact 
ccgagcgtgg 
tcgtacgtgc 
ttgaccggcg 
accgggtcga 
gcgccctgct 
acctgctgga 
aggttgccct 
cgcggatcgg 
acggtgaggt 
ttttcgcgcc 
tcctcggcct 
ctggcagctt 
ccggcctgcc 
aacgtgatgg 
gcgagggcgg 
ctgcgggtca 
ccacagctcg 
cccgcagtcg 
cctcgafccca 
ccgaggccac 
cgcgattgaa 
gcttctccgg 
gctcgggcgt 
cgggctcgag 
gacgcgcgcc 
tcgggatcac 
acaccacgag 
agatcgtcgc 
ggaccttcag 
ggaagaagtt 
gcgcgcagcc 
gcgccgcgca 
cgcagagccg 
agccctcgcc 
gccagcgctc 
cggcatcggc 
ggatgtcgaa 
cgcgctcgat 
gacggcccgc 
cgccctgcgc 
agccgggaac 



gccgccgatc 
ggcatcgccg 
gcccacgacg 
cgagccgccg 
cgcatcgacg 
ggtatcgccg 
gtcgccgctg 
ggtgacgaag 
gcgcgtggag 
gtcatcaccg 
gtgacgaacg 
cggtcgggaa 
cgatcgccat 
cagtggtatt 
tgacggcgaa 
cgttgccgcc 
tcgagtacac 
ctgcggtata 
tggtccactg 
tcaccagcgt 
cgcccacgaa 
aggctaccgg 
cgccatcgat 
cgcgcggggc 
ccacgccgcg 
ggccggcggc 
cgatgaccgg 
tcctggagcg 
cgcgcgagac 
gcgcgttcct 
ccgcgacgag 

tggggcagtg 

gatccagctg 
ggccccggcg 
gtcgccgtcg 
caccgggatg 
cgggttgtcg 
gtactgcgcg 
gcacaggccg 
caacttcagg 
acccgcggga 
gcgcacgtcc 
gcggtcgcac 
cgcatacgga 
cgcgcgtccg 
gtgcgcgagc 
cgtggcgtgc 
cggctggcgc 
aaccgcgtgc 
gcccatgcgc 
gcgcatccac 
gcgcgcatcg 
gcccgctgcg 
ctcgtgcagc 
cgcgacgctc 
gcgcggcgga 
gcgcacaccg 



agcgtcgaga 
ccggggccct 
agcgcatcgc 
aagatgcccg 
gtgccggcga 
acgacggtcg 
ccgccgagga 
ccgccaccgg 
cccgcgacgt 
ctgccgccga 
cgtcgcgatt 
gccgcgcgcg 
gccctcgtcg 
cagcgccgtg 
gcaatgcggg 
ggaggcgccg 
cacctgctgc 
ggtcgcgagc 
ctcgtcgccg 
gcgggtgctg 
gagctcgaag 
cgcgtgctgc 
ccacgcgtgg 
gaccacgaag 
atacaggtcc 
cgtggtgccg 
atcggaagcc 
tggctcgaag 
gtagagcgca 
gagggcatca 
cacggccagg 
accfcttctgt 
gcgatactac 
acaacggcgg 
cgggccagca 
ccgaggcagc 
cgcagcggtg 
atgggcacca 
aggaacaccc 
tcttccccat 



cggttcgcga 
ttcgcgccgc 
aacgccaccg 
ttgccgggcg 
tggcgcgcga 
acggcgtcgg 
tcgcccatca 
acgcgcacct 
gtgtcgttgc 
aacgccgggt 
gcctcttcgg 
gcgagcaacg 
cgcagcccga 
gcqtgcgcgg. 
acctcgcgcg 
gcgcccgaga 
ccggggccca 



acacgatgtc 
gcaccgggtt 
cattcggcgc 
acgattcgag 
aattgcgcac 
cctggcccgt 
acgtggaata 
gttgggtctg 
acgtgttgcc 
gatacgtgga 
gaggccggtg 
gccgaacccg 
tcgccggagc 
acgaatgcat 
aacgcgttgg 
agcgccagcc 
gtggtggggt 
ccgacgtaca 
gcgccgccca 
tcgtagtcgg 
cgcgcggcca 
gtaaattcct 
tcggcggacg 
tcgtattcga 
ttcagcgaca 
ctgtaatagt 
cccgcgaagc 
gcaatgccat 
tccgggccga 
gggacgacgg 
gggcggagcg 
tgttatggct 
cagcgagcga 
catcgcggtc 
cggcgggggc 
cgfcactcgag 
ccaccgcgac 
fcgtcgtgcac 
agtcgatctc 
gctgtttcgc 
ggccggccca 
ggccgaaggc 
cgcgcgcgat 
gcagttgcgt 
gcacctcgag 
cgtcgaggcg 
tcacgacctc 
cgccggagcc 
cgcgcggcgc 
ggtagtgcgg 
aaggaatgac 
ggaagtccgc 
ggcacaggtg 
agagcgccgc 
gcgatgcgta 
cggcccacgg 
gcaggtcggg 



gccgttgcgc 
cagggtcggg 
cagggcgacg 
cgcgccggtg 
cgttccggtc 
gaccgggttc 
cgcgaccggg 
gtacgcattc 
cgcattgtcc 
atagacgaag 
ccgacgaaca 
tgacatgcat 
cgcccagcag 
cgcgcccgcc 
tgccggtgga 
cgcggagata 
tgaacttcac 
cgttgccctg 
ggtacgtgga 
cgatcacgaa 
ccggcacgcg 
cctggcccac 
agaggtcgag 
gcgtgccctc 
cgcggccgaa 
ggctgagggt 
gcacgcgcaa 
cgcggctcac 
actggccctc 
catggacggc 
ccgggcgggc 
gcttgttgtc 
gtggttttfcc 
ctttgccagg 
atcgcggtag 
cagcttcagg 
gtcgagtgcc 
ttccgcggcg 
gcgatgcgtg 
ccccgcccag 
acgctccgca 
ctccgcgagc 
gcgctgggcg 
gagcaggtca 
cgaggtgagc 
ctgccactcg 
gacgcggccc 
ccagcggtcg 
gaagcgttcc 
atcgtcgtcg 
gagcttcgcc 
cgcgatgccg 
cgccatcgcg 
.gtcgpggttg 
gagtgccgcg 
cccgccgccc 
cgcgagggca 



acgcgagtga 
aagccggcga 
tccaggggat 
gccgagaagc 
tgcgggaagc 
accgcgatcg 
ccgccgctcg 
acgttgcgga 
gttgcgatac 
ccgccgttga 
cggctgcagc 
gaagccgttg 
ccgcgagaag 
ggcatggttg 
accggtgatc 
cgcttcggcc 
caccgtggct 
cgcgtccacc 
gaacgccatc 
gccggcctcg 
cgaatcgccg 
gcgcatgcgc 
gcggatctgg 
cttgccgtac 
gtgcggcacg 
ggcccggggc 
cagcgaggcg 
ggagacgcgt 
gttgcgctcg 
ggcgggagtg 
gccgctggag 
tttgcgggat 
gtgacccagt 
gcacgcacgg 
ggctcgagat 
ttgctcttcg 
agcgacgcga 
aacggcgcca 
gcgcgcacca 
cccacgcgcg 
tcgatcgcgt 
ggtgcggtgg 
atgtccgggt 
tccaggccca 
tggtagttgt 
acgcggttgg 
gcgcgctcca 
aacggaaacg 
acgggcccgt 
agcaccggct 
accggaacga 
gggcgcgcaa 
aacggccccg 

aggcgctcct 
gcgatttccc 
cccgtggagg 



29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
" 32280 
32340 
32400 
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ggcctgcgat 
catcgacgat 
cggtttcgcc 
gccagcgttt 
cagcccttca 
atgaaggcgc 
gcagctggca 
tggtggcgct 
acgagcagct 
gcgagatcgg 
gcgcggcgca 
gtggggacga 

tagagaagct 
tcggccagcc 
gcgcggttga 
agggcgatcg 
gcatcgccgc 
aacaacggat 
tcgcgataga 
ttcaggttct 
gccacgatcg 
cgctcgggaa 
ctcaacacca 
cgaaggatgc 
ttcgccgccg 
cccgccagcc 
ccgaggaagg 
gcaatgccgt 
afcgcgcatcg 
gacgccgcgg 
acgatcgcct 
gcggcgcggc 
tacgagaagt 
aggcgcaggc 
agcagttcgc 
acgcggatgc 
aggcgctcgg 
gcgacgtcct 
gagtcgaggc 
gccgcggcgt 
acgcccgcga 
gcgtcgatgc 
tcgcagcacg 
ccgtagccgc 
accagcatgc 
gcgacgcagg 
gcgacgccgg 
cgcgcatcgc 
tccgaattga 
gcgaagccga 
ctgcccagca 
tgcgcggcgc 
ttgaagaccg 
atcgcgccag 
cgtaggcgcc 
gctcgtggcg 
ccgccgtcac 



gccctgctcg 
cgcgatccac 
gcccaccgga 
cggcgccctfc 
cgagcgcctc 
aaatcgtgcc 
ggaacgcggc 
cgcgcgcggc 
cgcgcagtgc 
tgacgcccgc 
gcaccgcgag 
acgccggatc 
cggcgttcgg 
ggtcgccctc 
cgcgcccgag 
cggcagtgct 
ccagcggcgt 
cgatcgttgc 
gcgccaggta 
ccgcgacgcg 
cgccgggatc 
tggcgccatc 
ggcagaaggt 
gcggcaggtc 
cttcgagcgc 
ggcgcgcggc 
ccacgcgcag 
gcccgatgac 
ccgcgtactc 
cgaagcgcga 
cgctgcggcc 
cgcacggcag 
cgtgcgccgt 
tgttccagcc 
ggaaggccgc 
cgggcgtcat 

tggtgaagtc 
tgcgcagcag 
gatgcagcac 
tgacgcgctc 
cgtaggtggg 
cggccacccc 
cgatctgcca 
ggccatagag 
agaagccgac 
cggagatggc 
gcaggctgag 
tgtcgcggca 
gcaccacgaa 
ggttcttcgc 
acccggcgag 
ccgtgtggcg 
ggacgacgac 
caggcgctgg 
tccatgcgct 
accggagacg 
cgcctcgcgc 



aggcgctcga 
agcgtcgccg 
accaggtgcg 
cgcgtcgaga 
gatgcacgcc 
ggccccgatc 
ctgcgtgcgc 
gtcgctttcc 
caccacggcg 
ccagccgccg 
cccggcgagc 
gtagcgttcg 
gttgcgtgcg 
ctccaccagc 
cgccgcgtcg 
cctggccggc 
gcgctcgcgg 
gggcacagca 
ctcggccgca 
atcgagcgcc 
gttgtgcggc 
gcgcgtgccc 
ctcctccatc 
cgtcgcggca 
gagatcgcgc 
ctcgatgatc 
caccggccgc 
gcgcagcttc 
ggacgggcag 
ttcgaggtag 
gcgcaggcaa 
gtcctgcggc 
gatgaccacg 
caccaacgac 
ctccacgccc 
gcgctcgggg 
ggtccacgcg 
gttcaccagc 
ctgcagcacg 
ggcggcgcgc 
ccagcggcgc 
gccgaacgag 
gtcccggtcc 
cgagtcgaac 
cgccgtcggc 
atcgacgtcc 
gatcgtcgcg 
acgcagcagg 
gtcgccgccg 
attggtgacg 
gcgcggatcg 
ggccagcgac 
tagcgccggc 
tgctcctgcg 
cccaggtgcg 
tcgtggtcgc 
tcgccgcaca 



gccagccggc 
cgttgcggat 
tgcgcacgcc 
tcgacgtaca 
accagccgct 
gcgcgcgccc 
gtgcggcgga 
tgcgcggcga 
ctcgcacggc 
gcgcgcagcc 
ggctggtgtc 
ccgtgcgcgc 
accgcacccg 
acgacccatt 
ctcgcgtcgc 
aaggcctcgc 
cggcgtggaa 
tccaccgcgc 
ttcgcctcga 
gtgcggtcgt 
agcaggaagc 
gccaccggga 
gtcgagggca 
taggcgccgc 
ggaccccagg 
aggtgcgcgc 
gcttcggggc 
gcttcgcagc 
aggatcacgt 
ccggtcacgg 
tccacgcatc 
ccggtcatca 
cgtgcccccg 
tggaagtgca 
gcgtcggcaa 
ttcaggcgcg 
cgcgggccct 
gcgcgcgtgt 
cgcgggcggc 
cgcagcgggt 
tcgaggcgca 
gcctcgcccg 
agcgcgcgca 
gggccgcatt 
aggcgcggat 
ggctcgccgt 
cggttggaga 
ccctcgagcc 
ctgccggcca 
acacgagtgc 
gtgctcgcgt 
gcgaggcagc 
gccatccgac 
agagtgcctc 
tgcgcgcctc 
ggtcggagag 
gggcgaggaa 



gcggaafctcg 
ggcgcgcgcg 
gcgcccgccc 
cgtggccgat 
ccggcgcgtg 
acgatgcggg 
agtgctccgc 
tgttcaggtc 
cctcctccgc 
cgccacgctc 
gcgcgaactc 
tcagggcatc 
ccaccgcgct 
cgcccttcgc 
gctcgaggac 
gcgccatcca 
agccgaggca 
catcgcgctc 
tcgtcttcgg 
cgcgcaggcg 
cggtctcgcc 
tgccgaggcg 
gcaggaccag. 
gctgcatgat 
cctcgatcgc 
ccttgcgccc 
gcggaagcag 
tcgcgccgaa 
cggccgcgtc 
acgccggcgc 
cgccatcggc 
tgttgtagtc 
actcacgcgc 
cgacgtcgta 
ccgccgcctc 
cgagfccgcac , 
cgggcagcac 
gctcctcaat 
ccggcatccg 
tcgcgatcgc 
cgcggttggc 
cgfcggtgcac 
tcgagaggtc 
gcacccacac 
actcgcggcg 
cgggccgctc 
gcgggcacac 
aaccggggcc 
ggcccgcagc 
cggcacgacg 
catcgacgag 
gttcggtgtc 
cggcgcttcg 
gagctgcttc 
ctcgagctgc 
cagcgagaag 
gtacatcgca 



ctgcaacgcg 
aacgcggagc 
agctcaccgg 
acgttcgcgg 
cgtcgcgtca 
cgcatcggac 
gaccacgcgc 
gaggcgatgc 
gaggcgcagc 
cagcaacgca 
cgcgccgaac 
actgcagccg 
cggcgcgcca 
gtccgcgatc 
cacggtctcc 
gtggtcgtag 
cgccgcggcg 
cgccgccagc 
gcggtggtac 
cagcaggagg 
atcgcgcacg 
ctgtgcctcg 
gtcgatgccg 
gcccgccgca 
gatgcgctgc 
gccgaagcgg 
cggcaggtcg . 
gcccttgcgc 
cacggcatcg 
cgtcggcccc 
gccgcgcgcg 
ggcgcacagg 
gatgcgcggc 
gcccccggac 
gtgcccgatc 
gtgcgtcgcc 
cagcgtcgag 
gccgccgcgc 
ctcgcgctcg 
ccacgcgcgc 
ctcgcgcgcg 
gtagacgtcg 
gttttcctcg 
ctcgcggcgc 
gctggcgcgc 
gaagatatcg 
gatgccgacc 
gacctgcgta 
gaccgcgcct 
cgccgcgaag 
gacgatgccg 
gtcgtggccg 
agctgggcca 
tgcgaggcga 
cgcagcgact 
cgcccggccc 
tcggcgacgc 



32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

3.4500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 



* 
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ccgccgcggg tttcgcagcc tccggcgccg aagcttgcag caactgcacg gcgtcgggca 35880 

ccgcgtgcag cggccagatc gccgaatacg catcgacgcg ttgcccgaac agggccacct 35940 

ccgggaaccg cgccttcagc acggcgagga attcgfcgctc gtagagctcg cgcacgtgat 36000 

gcgggttgcg gtagtcgcgc tggtccgagt acacctcgcg gttcggcgtc gagaccagca 36060 

gcaggccgcc gggcgccagc acgcgcttcg cttcgtcgag caggcgctcg ggatcgggaa 36120 

tgtgctcgag cgtctcgaag gagacgagga gatcgatgct tgcgtcatcg cacggcagcg 36180 

actcgcagcg gccctcgacg tactgcaggt tctgcgcggc accatagcgg cggcgcgcct 36240 

gcgcgatggt ctcggcggcg acatccgcgc ccaccacgga cttcgcgcgc gtcgccagca 36300 

gcgccgagcc gtagccctcc ccgcaggcga tgtcgaggac gcggcagcct ccggcgagcg 36360 

gcaacgcaaa gtggtagcgg tgccagtgct cgtaccagat ctcgccctcg aagccgggct 36420 

ggaaacgtfcc gtgttccatc gggagagtat aggtggatgt gcaaccgccc tccggagagg 36480 

gcggttgcga tgatcttatt cgagatcggc tgctacggaa gcaccggcaa cggctgcggc 36540 

ggcgccaccg accatccggt gcccgcgccc agcaggttcg cgctgccgag cgtggtgagg 36600 

ccgtccatca ggcgcaccgt gatgcggcca tcgacattcfc tgaacaccat gtcgagcttg 36660 

ccgtcgccat cgaagtcgag cagttgcgtg acggtccagc ccgcccccgc cggcagcacg 36720 

tccgccgcgc tgaggatggc ggtgccgttc atcaggcgca cgtgcgcacg gccgtcggtg 36780 

tggcggaaca cgatgtcacc gcgcccgtcg cggttcatgt cgcccacgtg gctcacgatc 36840 

cagccggtgc ccgcgccgag gagctcggaa ccggcgccga acgcggtgcc gttcatgatg 36900 

aagagatgcg cgcggccatc ggtgtggcgg aagaacafcgt ccgccttgcc gtcgccgctc 36960 

acgtcgccca ggtgcgtcac cgtccagccg ctggcgggcg agaggaagcc cgcgccggcg 37020 

gtgatggtgg tgccgttcat caggtagatg tagccgcggc cgtcagcgtg gatgaagacg 37080. 

atgtcgtccc tgccgtcgcc gttgaggtcc ccggtggcca cgactctcca gcccgtcgcc 37140 

ggccccagca gctgctggct gccgatgatg gccgtgccat ccatcagcca caggtgcgag 37200 

cggccgtcgg cgttgcgcag caggagatcc gccttgccgt cgccgttcat gtcggcggtg 37260 

cggtcgatgc tccagccgag gccggcgccc agcagcfccct tgccgccggt caccgtgagg 3 7320 

ccgttcatcg tgtacacgta cacgcgcccg tcggtgtggt gggatcctct agagtcgacc 37380 

tgcaggcatg caagcttgag tattctatag tctcacctaa atagcttggc gtaatcatgg 37440 

tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttcacacaac atacgagccg 37500 

gaagcat 37507 
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